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ABSTRACT 

We present new high-precision measurements of the opacity of the intergalactic and circumgalactic medium 
(IGM, CGM) at (z) = 2.4. Using Voigt profile fits to the full Lyman a and Lyman (3 forests in 15 high- 
resolution high-S/N spectra of hyperluminous QSOs, we make the first statistically robust measurement of 
the frequency of absorbers with H I column densities 14 < log(A^Hi/cm"^) < 17.2. We also present the first 
measurements of the frequency distribution of H I absorbers in the volume surrounding high-z galaxies (the 
CGM, 300 pkpc), fin ding that the incidence of absorbers in the CGM is much higher than in the IGM. In 
agreement with Rudi e et al.l (120121) . we find that there are fractionally more high-A^ni absorbers than Iow-A^hi 
absorbers in the CGM compared to the IGM, leading to a shallower power law fit to the CGM frequency 
distribution. We use these new measurements to calculate the total opacity of the IGM and CGM to hydrogen- 
ionizing photons, finding significantly higher opacity than most previous studies, especially from absorbers 
with log(A^Hi/cm"^) < 17.2. Reproducing the opacity measured in our data as well as the incidence of ab- 
sorbers with log(A^Hi/cm"^) > 17.2 requires a broken power law parameterization of the frequency distribution 
with a break near A^hi~ 10'^ cm"^. We compute new estimates of the mean free path (A^fp) to hydrogen- 
ionizing photons at Zem = 2.4, finding Amfp = 147 ± 15 Mpc when considering only IGM opacity. If instead, 
we consider photons emanating from a high-z star-forming galaxy and account for the local excess opacity due 
to the surrounding CGM of the galaxy itself, the mean free path is reduced to A^nfp = 121 ± 15 Mpc. These 
Amfp measurements are smaller than recent estimates and should inform future studies of the metagalactic UV 
background and of ionizing sources at z « 2- 3. 
Subject headings: intergalactic medium — quasars: absorption lines 



1. INTRODUCTION 

Observations of the Lyman a forest imprinted upon the 
spectra of background QSOs provide a rare window into 
the nature and evolution of baryonic structures in the uni- 
verse. With the advent of the theory of cold dark matter 
(CDM), it was recognized that self-gr avitating halos could 
be the sites of un collapsed Lya clouds dUmemura & Ikeuchil 
119851; iReesI 1198 6). In the modern version of this model, 
the Lya forest simply represents the array of density fluc- 
tuations present in the earl y universe and evolved under 
gravity to the prese nt day (iTytled Il987t ICen et al.l 119941 
iHemquist et al.l[l996h . modulo the effect of the formation of 
stars, black holes, and galaxies which appear to have polluted 
a large fraction of the cosmic hydrogen with metals , even 
at early time s (ISongaila & Cowi3 Il996t JDave et al.1 119981; 
Schave et al] 120031; ISimcoe et al.1 120041; iRvan- Weber et all 
2009; Becker et al."2009^. 'Simcoe"201 U ISimcoe et al.l l201 U 
Becker et al. 201 1 , 2012) and led to t he ion ization of the vast 
majority of the gas (Gunn & Peter sonll 19651) . 

There is an extensive literature concerned with the mea- 
surement of cosmological parameters using observations of 
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the Lyg forest [for a review, see iRauchI (119981) and iMeiksinI 






(I2009h 1. Considerable progress has been made in the last 
several decades in measuring the detailed properties of the 
IGM and their implications for cosmology, structure forma- 
tion, reionization, and the metagalactic background. 

One of the most fundamental of these measurements is the 
number of absorbers as a function of column density (A^hi) 
and redshift that can be seen along a given path through the 
Universe. This measurement has required significant obser- 
vational effort, in a large part due to the very large dynamic 
range of the measurements (10'^ < A^hi ^ 10^^ cm"^) which 
have necessitated surveys optimized for particular ranges in 
A^Hi- This requirement is easily understood by considering the 
curve of growth of the Lya line of neutral hydrogen. 

The most common absorbers with log(A^Hi/cm"^) < 14 are 
optically thin in the Lyman a transition and are thus on the lin- 
ear part of the curve of growth. As such, their properties can 
be measured with a few high-resolution ( ^10 km s~|) high - 
signal to noise ratio (S/N ^ 50) spectra (' Kim et al] 120021) . 
At the highest column densities, log(A^Hi/cm"^) > 20.3, the 
Lorentzian wings of the absorption line in Lya become ob- 
vious against the continuum of the background source. Such 
rare pockets of predominantly neutral hydrogen, commonly 
referred to as Damped Lyman a absorbers (DLAs), can be 
identified and measured to high statistical precision with a 
large number of low to moderate resolution spectra of back- 
ground QSOs as has been done usi ng (e.g.) the QSO sa mple 
within the Sloan Digital Sky Survey (jNoterdaeme et al.l2 009). 

Absorbers with intermediate Nhi, 14 < log(A^Hi/cm"^) < 
20, lie on the flat part of the curve of growth thus requiring 
different techniques. Systems with A^hi> 10'^'^ cm"^ have 
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an optical depth to hydrogen-ionizing photons r > 1 , and so 
may be recognized by the strong breaks they produce in back- 
ground QSO spectra at the Lyman hmit (9 12 A) in their rest 
frame. These so-called Lyman Limit Systems (LLS) can thus 
be easily discovered using low resolution spectra; however, a 
large sample is needed because they are comparatively rare. 
For studies at z < 2.6, surveys of LLSs require space-based 
observations as the Lyman limit shifts below the atmospheric 
cutoff at UV wavelengths. However, over the last 30 years 
with a combination of ground and space based observations, 
the distribut ion of LLSs at 2 < z < 3 has been wel l character- 
ized (Tvtle ij|1982t ISa rgent et al. 1989; Stengler- Larrea et alj 
[T995;,Ribaudo et aljf201 k.O'Meara et al..,20 12>. 

Finally there is the historically most-problematic range, 
14 < log(A^Hi/cm"^) < 17.2. These intermediate-A^Hi ab- 
sorbers are saturated in Lya but have relatively low opac- 
ity to hydrogen-ionizing photons and therefore weak "Lyman 
breaks." For this reason, observational constraints on their 
statistical incidence have been approximate at best. 

In this paper, we address these intermediate-A^Hi absorbers 
using Voigt profile fits to all-available higher-order Lyman se- 
ries transitions. We take advantage of the increased dynamic 
range of the measurements afforded by the decreased oscilla- 
tor strength of the Ly/3,7, etc. transitions to accurately mea- 
sure the frequency distribution of these absorbers, f{N,X), for 
the first time in a statistically robust sample. In our derivation 
of an analytic representation of f{N,X) we place emphasis on 
finding a method which best reproduces the measured opacity 
of absorbers to hydrogen ionizing sources. 

These new measurements are crucial to our understanding 
of the opacity of the universe to hydrogen-ionizing photons. 
While individually absorbers with log(A^Hi/cm"^) < 17.2 have 
relatively low optical depths at their Lyman limit, they are 
much more common than LLSs, and therefore their ensemble 
contributes nearly half of the IGM's opacity. Here, we present 
the most precise measurements of the continuum opacity of 
absorbers that are optically thin at the Lyman limit. 

In this paper, we also attempt to account for the fact that 
galaxies are locate d in regions of relatively high gas density. 
iRudie et aP (120 12h clearly demonstrated that typical galaxies 
at z « 2.3 have large quantities of H I surrounding them to 300 
physical kpc (pkpc), and that there is excess H I compared to 
random places in the IGM to ^2 physical Mpc (pMpc) . In the 
case that galaxies such as those studied in Rudie et al.l ( 12012b 
are significant contributors to the ionizing background, the ef- 
fect of their surrounding circumgalactic gas distribution is an 
important consideration. In this work, we attempt to separate 
the probability that an ionizing photon will escape the ISM of 
its galaxy from the probability that such a photon, after escap- 
ing to 50 pkpc, will also escape the enhanced opacity found 
in its circumgalactic medium (CGM). 

We begin in Section|2]with a description of the unique data 
that enable these very precise A^hi measurements. In Section 
[5] we present the new A^hi frequency distribution measure- 
ments. Section]?] considers the frequency distribution within 
the CGM of galaxies and Section ]5] quantifies the total opac- 
ity of the IGM and the environment of the source galaxy. Up- 
dated measurements of the mean free path (A^fp) to hydrogen- 
ionizing photons are presented in Section ]6] along with their 
implications. The paper is summarized in Section]?] 

Throughout this paper we assume a A-CDM cosmology 
with Ho = 70 km s"' Mpc"', Q^ = 0.3, and Qa = 0.7. All dis- 
tances are expressed in physical (proper) units unless stated 
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Figure 1. Histogram of the pathlength sampled as a function of red- 
shift. The path-length-weighted average redshift of the sample is 

z = 2.37. 

otherwise. We use the abbreviations pkpc and pMpc to in- 
dicate physical units. At the path-weighted mean redshift of 
the absorber sample ((z) = 2.37), the age of the universe is 2.7 
Gyr, the look-back time is 10.7 Gyr, and 8.2 pkpc subtends 
one arcsecond on the sky. 

2. DATA AND ANALYSIS 

The data presented in this paper constitute a subset of the 
Keck Baryonic Stmcture Survey (KBSS). The KBSS was de- 
signed to detect and characterize the gaseous distribution sur- 
rounding star-forming galaxies at 2 < z < 3 during the peak 
of cosmic star formation and black hole growth. The data 
include rest-frame UV spectra of 2188 star-forming galaxies 
located in 15 fields surrounding the lines of sight to hyper- 
luminous (mv — 15.5 - 17) QSOs. The present work focuses 
primarily on the H I absorption line statistics resulting from 
analysis of high-resolution, high signal-to-noise echelle spec- 
tra of the 15 KBSS QSOs. A full description of the KBSS 
will be presented by Steidel et al. (in prep), but the portions 
of the data most relevant to this work (as well as a detailed 
description of the analysis of the QSO data) can be found in 
iRudie et al] (120121) . Here, we provide a brief description. 

The 15 KBSS QSOs were observed primarily with the High 
Resolution Echelle Spectrometer (HIRES; Vogt et al. 1994^ 
on the Keck I telescope. The HIRES spectra have Rc^ 45, 000 
(FWHM- 7 km s"'), S/N - 50-200 per pixel, and cover at 
least the wavelength range 3100 - 6000 A with no spectral 
gaps. The UV/blue wavelength coverage provides a signifi- 
cant advantage over other data sets as it enables the observa- 
tion of Ly/3 A1025.7 down to at least z = 2.2 in all 15 of our 
sightlines, and to significantly lower redshift in many.^ The 
additional constraints provided by Ly/3 (and in many cases, 
additional Lyman series transitions) allow for much more ac- 
curate measurements of H I for A^hi= 10'"^- 10'^ cm"^ than can 
be made with spectra including only the Lya transition. A full 
discussion of the redshift sensitivity of our fitting procedure is 
given in AppendixlAl 

The reduction of the HIRES spectra was performed us- 
ing T Barlow's MAKEE package as described in detail by 

^ The minimum wavelength! tliat can be used for line fitting is determined 
by both the wavelength coverage of the spectrum (set by the spectrograph 
and the atmosphere) and by the location of the highest-redshift Lyman Hmit 
system along each line of sight. The minimum redshift for which we perform 
line fitting is shown in Table[T] 
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KBSS Absorption Line Sample 












Name 


RA 


Dec 


ZQSo" 


'^min [A] 


z range 


Az 


AX'= 


Ng.," 


S/N Lya^ 


S/N Ly/?'' 


Q()100+130(PHL957) 


01:03:11.3 


+13:16:18.2 


2.721 


3133 


2.0617-2.6838 


0.62 


2.02 


5 


77 


50 


HS0105+1619 


01:08:06.4 


+16:35:50.0 


2.652 


3230 


2.1561-2.6153 


0.46 


1.50 





127 


89 


Q()142-()9 (UM673a) 


01:45:16.6 


-09:45:17.0 


2.743 


3097 


2.0260- 2.7060 


0.68 


2.21 


3 


71 


45 


Q0207-003 (UM402) 


02:09:50.7 


-00:05:06.5 


2.872 


3227 


2.1532-2.8339 


0.68 


2.26 


1 


82 


55 


Q0449-1645 


04:52:14.3 


-16:40:16.2 


2.684 


3151 


2.0792- 2.6470 


0.57 


1.84 


2 


73 


41 


Q0821+3107 


08:21:07.6 


+31:07:51.2 


2.616 


3239 


2.1650-2.5794 


0.41 


1.35 


2 


50 


33 


Q1009+29 (CSO 38) 


10:11:55.6 


+29:41:41.7 


2.652^ 


3186 


2.1132-2.6031 


0.49 


1.59 





99 


58 


SBS1217+499 


12:19:30.9 


+49:40:51.2 


2.704 


3098 


2.0273- 2.6669 


0.64 


2.07 


5 


68 


38 


HS 1442+2931 


14:44:53.7 


+29:19:05.6 


2.660 


3152 


2.0798- 2.6237 


0.54 


1.76 


6 


99 


47 


HS1549+1919 


15:51:52.5 


+19:11:04.3 


2.843 


3165 


2.0926- 2.8048 


0.71 


2.35 


4 


173 


74 


HS1603+3820 


16:04:55.4 


+38:12:01.8 


2.55 IS 


3181 


2.1087-2.5066 


0.40 


1.28 


2 


108 


58 


Q1623+268 (KP77) 


16:25:48.8 


+26:46:58.8 


2.535 


3126 


2.0544- 2.4999 


0.45 


174 


3 


48 


28 


HS 1700+64 


17:01:00.6 


+64:12:09.4 


2.751 


3138 


2.0668-2.7138 


0.65 


2.11 


4 


98 


42 


Q2206-199 


22:08:52.1 


-19:43:59.7 


2.573 


3084 


2.0133-2.5373 


0.52 


1.68 


3 


88 


46 


Q2343+125 


23:46:28.3 


+12:48:57.8 


2.573 


3160 


2.0884- 2.5373 


0.45 


1.45 


3 


71 


45 



' The redshift of the QSO I'Trainor & Steider2012') 

'' The minimum wavelength covered in the HIRES QSO spectrum 

'^ The comoving pathlength in this line of sight (equation[3). 

'' The number of galaxies in our LRIS survey with iinpact parameters < 300 pkpc. 

'^ The average signal to noise ratio per pixel of the QSO spectrum in the wavelength range pertaining to IGM Lya and Ly/3 absorption. 

' The redshift of this QSO was revised after the fitting of the HIRES spectrum was completed. The maximum redshift assumed zqso = 2.640 

s The redshift of this QSO was revised after the fitting of the HIRES spectrum was completed. The maximum redshift assumed zqso = 2.542 



iRudie et aP (l2012 h. The analysis presented here includes a 
Voigt profile fit to the full Lya and Ly/3 forests in these spec- 
tra. The redshift range included in the fit is bounded at high 
redshift by the proximity zone of the QSO (taken to be 3000 
km s"') in which absorbers may be ejected from or ionized by 
the QSO itself. The low -redshift cut was chosen to ensure that 
all H I absorbers included in the sample had been observed in 
both the Lya and Ly/3 transition. The final redshift windows 
used for each spectrum are detailed in Table [T] 

The line fitting was facilitated by a semi-automatic code 
that estimates input parameters (redshift z, H I column den- 
sity A^Hi, and Doppler parameter b^ of each ab sorber) us- 
ing a cross-correlation technique as described in iRudie et alJ 
(120121) . These parameters were then input into the x^ min- 
imization code VPFIT^ written by R.F. Carswell and J.K. 
Webb. The results were iteratively checked, altered, and re- 
run until a good fit was achieved. Median uncertainties in 
log(A^Hi) for these absorber measurements are listed in Table 
121 Examples of fits to absorbers with log(A^Hi/cm"^) > 15.5 
are given in AppendixlAlalong with an assessment of the com- 
pleteness of the catalog as a function of redshift. 

The result of this process is an H I absorber catalog includ- 
ing 5758 absorbers with 12.0 < log(A^Hi/cm"2) < 17.2 and 
2.02 < z < 2.84 over a total redshift pathlength of Az = 8.27. 
This represents the largest Lya forest catalog to date at these 
redshifts and increases by an order of magnitude the number 
of absorbers measured using the additional constraints pro- 
vided by higher-order Lyman series transitions. 

3. A'hi FREQUENCY DISTRIBUTION 

The differen t ial col umn density distribution, first defined by 
ICarswell et al.l 11984), is a useful way of parameterizing the 
frequency of absorbers as a function of A^hi- We consider the 
differential column density distribution per unit pathlength, 
f(Nm,X), defined as 



Table 2 

Median log(A'Hi) uncertainty 
for absorber measurements 



loglA'm/cm -) 


0-log(«„,) 


12.0-12.5 


0.10 


12.5-13.0 


0.07 


13.0-13.5 


0.07 


13.5-14.0 


0.06 


14.0-14.5 


0.04 


14.5-15.0 


0.03 


15.0-15.5 


0.03 


15.5-16.0 


0.04 


16.0-16.5 


0.07 


16.5-17.0 


0.06 


17.0-17.5 


0.09 



where m is the observed number of absorbers with column 
densities in the range AA^hi- ^X is the comoving pathlength 
of the survey summed over th e sightlines. This comov ing 
pathlength was introduced by iBahcall & PeeblesI (119691) in 
part to understand if the statistics of absorbers were consistent 
with having been produced by galaxies. With this formalism, 
absorbers with a constant physical size and comoving number 
density will have constant f{N,X). The comoving pathlength 
of a single sightline, AX,-, is defined as: 



dX=-^(l+zfdz 



H{z) 



AXi = 



dX = 



(l+zf 



y^nA + ^m(l+z)- 



(2) 



:dz (3) 



fiNHi,X)dNHidX = ■ 



m 



-dNmdX 



(1) 



AA^HiAX 

* http://www.ast.cam.ac.uk/~rfc/vpfit.html; © 2007 R.F. Carswell, J.K. 
Webb 



The total pathlength covered by this survey is AX = 26.9 and 
the Ax covered as a function of redshift summed over the 15 
lines of sight is shown in Figure [1] The pathlength-weighted 
mean redshift of the sample is (z) = 2.37. Measurements 
of f(N,X) derived from the KBSS sample of absorbers with 
12.5 < log(A^Hi/cm"^) < 17.2 are presented in Figure |2] The 
data points are located at the mean log(A^Hi) of the bin edges. 

3.1. Parameterizing the frequency distribution 
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Figure 2. The frequency distribution of absorbers from the KBSS H I sample (black data points). For bins with Nui > 13.5 cm ^, the number of 
absorbers per bin is displayed. The (red) line is a maximum likelihood power-law fit to the distribution of absorbers with 13.5 < log(A'Hi/cm"^) < 



17.2. The (green) triangle represents the frequency of LLS calculated from other studies as discussed in 33.31 The dark green band at the top 
of the triangle represents the error in this estimation. Note this is a single power law fit to the data. Section lSTI discusses broken-power law fits 
which better reproduce some aspects of the data, especially the incidence of LLSs. 



It has become standard to represent f{Niii,X) by a power 
law, or a series of broken power laws of the form: 



f(NHi,X)dNHidX = CmN^PdNnidX. 



(4) 

iTvtleri (Il987h first noted that the full column density distri- 
bution from 12 < log(A^Hi/cm"^) < 21 was reasonably well 
approximated by a single power law with f3 = 1.5. The de- 
tailed deviations from a single power law at vaiious A^hi 
and the change in normalization and slope of f{Niii,X) 
as a function of redshift have been considered by numer- 



ous authors (e.g.. ' Sargent et al.1 [19891 iPetitiean et al.l 1 1993 
Dave & Tripp 2001: iKim et alj 120021: iPenton et alj I2OO4I: 
Janknecht et al. 2006; Lehner et al. 2007; 'Pro chaska et al.l 
2010; Ribaudo et al. 201 1; O'Meara et al. 2012). 

The most notable deviations from a single power law ap- 
pear at the ends of the distribution. The Iow-A^hi end ap- 
pears to flatten, an effect generally attributed to a combination 
of incompleteness due to line blending and blanketing and 
a true physic al turnov er (Hu et al. 1995; Ellison et al. 1999; 
ISchaydl200H iKim et alJl2002i) . The deviation from a single 
power-law for log(A^Hi/cm"^) > 19 is believed to be due to the 
transition of gas from optically thin to self-shielding regimes 
jMurakami & Ikeuchi"1990^, 'Petitiean et al."1992'; 'Katz et 
1996; Corbelli et al. 2001; Zheng & Miralda-Escude 200 
Altavetal .112011 : McOuinn et al. 201 1). The most recent es 



timates of the frequency distribution of absorbers with 19 < 
log(A^Hi/cm-2) < 20.3 at z w 2 are bv lO'Meara eTaTI (l2007h : 
the frequency distribution of damped Lyman a systems with 
log(A^Hi/cm"^) > 20.3 is measured to relatively high preci- 



sion using the Sloan Dig ital Sky Survey (SDSS) QSOs by 
iNoterdaeme et al.l (l2009h . In this paper, we leverage the 
KBSS QSO absorption line data set to measure f{N,X) for 
absorbers with log(A^Hi/cm~^) < 17.2. 

3.2. Maximum Likelihood Method 

A statistically rigorous estimate of the power law exponent 
and normalization can be found via the Maximum Likelihood 
Method. The maximum likelihood estimator (MLE) for the 
power law exponent is: 



I3=l+n 



Vln^ 

J 'min 



!=1 



(5) 



where n is the total number of absorbers in the sample and 
A^min in the column density of the lowest-column absorber (see 
e.g.. IClauset et al.ll2007h . The standard eiTor on /3, derived 
from the width of the likelihood function, is: 



1 



•^r 



y/n 



Similarly, the estimator of the normalization, Chi, is 

n(l-/3) 



C, 



HI ■ 



WVmax 



■A^ 



(1-/3) 



EAX 



(6) 



(7) 



(lTvtleiJll987h where EAX is the sum of the pathlength over 
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Figure 3. The frequency distribution of absorbers from the KBSS H I sample (black points) and the maxim um likelihood fit t o the data (red 
curve). For comparison, over plotted in (blue) squares are the data points from the (z) =2.1 sample from iKim et al] ( l2002h . Because the 
analysis of the KBSS sample includes higher-order H I transitions, we can measure the frequency of absorbers with log(A?Hi/cm"^) > 14 with 
much higher fidelity than has been previously possible. Notably, there is no evidence for a strong break in the power law distribution near 
log(A'Hi/cm"~) ~ 14 as has been previously suggested. 





Table 3 




Maximum Likelihood Fits" 




log(A'HI,min) 


/9 


a^ lo 


g(CHl) 


12.5 


1.518 


0.008 


8.43 


12.6 


1.544 


0.008 


8.80 


12.7 


1.567 


0.009 


9.13 


12.8 


1.585 


0.010 


9.38 


12.9 


1.605 


0.011 


9.67 


13.0 


1.616 


0.012 


9.82 


13.1 


1.627 


0.013 


9.99 


13.2 


1.636 


0.014 


10.11 


13.3 


1.651 


0.015 


10.33 


13.4 


1.658 


0.016 


10.44 


13.5 


1.650 
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" Maximum likelihood fits to the data with 
log(/VHl,mm) < log(iVHi/cm-2) < 17.2. 

all 15 sightlines . 

In Figure |2] the MLE fit to the data is shown for absorbers 
with 13.5 < log(A^Hi/cm"^) < 17.2. Note that a single power 
law reproduces the distribution with reasonable fidelity. No 
strong breaks in the distribution are evident. 

The choice of a minimum A^hi= 13.5 used in the fit is 
motivated as follows. As shown in Figure |2] f(Niii,X) 
for log(A^Hi/cm~^) < 13.5 appears to flatten. As mentioned 
above, this flattening has previously been observed (,Hu et aL 



[1991 lEllisonetalJ[T999l : iKim et alJ [200l) and is generally 
attributed to a combination of incompleteness in the sam- 
ple caused by line blending and a true physical turnover 
in f(Nm,X). Here we do not attempt to diagnose the 
cause of the Iow-A^hi turnover, but simply note that above 
log(A^Hi/cm"^) > 13.5, f(N,X) appears to be well fit by a 
single power law. More quantitatively, one can measure the 
change in power law slope introduced by varying the mini- 
mum value of A^Hi included in the fit as shown in TablelH Note 
that a choice of log(A^Hi/cm"^) > 13.3 produces insignificant 

change in the maximum likelihood estimate of the slope, /3. 

Prior to the present work, the largest existing sample of 
Lyg fores t absorbers at these redshifts was described by 
iKim et aJl (|2002). Their sample included 2130 H I absorbers 
with 12.5 < log(A^Hi/cm"^) < 17 observed along 9 lines of 
sight. The most marked improvement over the previous mea- 
surements provided by our study is the use of higher-order 
transitions of H I which were generally not included in pre- 
vious work. These allow for more precise measurements of 
A^Hi above the saturation point of Lya, A^hi ~ lo'"*"'"*'^ cm"^. 
Above log(A^Hi/cm"^) > 15.5 where the Ly/3 transition satu- 
rates, the sample presented herein may suffer a small degree 
of incompleteness at z < 2.4 as discussed in AppendixlAl 

In Figure[3] the KBSS data (black diam onds) are compared 
with the measurements of :Kim et alJ (1200 2) at (z) =2.1 (blue 
squares). Notably, at Iog(A^Hi/cm~^) < 14 there is good agree- 
ment b etween the two d ata sets, but for log(A^Hi/cm"^) > 
14, the iKim et aP (120021) sample exhibits significant scatter 
not present in the KBSS measurements, with a systematic 
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tendency to under-estimate the number of intermediate-A^ni 
(14 < log(A^Hi/cm"^) < 17) absorbers compared to our mea- 
surements. Potential incompleteness found in the lower-z por- 
tion of the catalog presented herein (see Appendix |A) may 
also affect the iKim et alJ ( 1200 2) catalog for absorbers with 
log(A^Hi/cm~^) > 14 and may explain why their measurements 
are systematically lower than those presented here. 

3.3. Lyman Limit Systems 

In this section, we quantify the incidence of absorbers with 
A^Hi> 10'^'^ cm"^, Lyman Limit Systems (LLSs) which are 
optically thick to hydrogen-ionizing photons. Because LLSs 
are relatively rare, and because their A^hi are challenging to 
determine accurately via line fitting, we use samples reported 
in the literature to measure the frequency of optically thick 
absorbers at the same redshifts as the KBSS sample. 

We estimated the incidence of LLSs with log(A^Hi/cm"^) > 
17.2 at (z) ~ 2.4 using data from two surveys that nicely 
bracket the relevant range of redshifts for our IGM sam- 
ple. We evaluated the LLSs over a somewhat wider redshift 



range than 2 , 



, 2.8 (but having the same mean redshift) 



in order to mitigate the statistics of small numbers. Over 
the ran ge 1.65 < z < 2.55, we used data from Ribau do et al.l 
(120111) from their survey compilation of archival HST spec- 
troscopy; results for 2.70 < z < 2 .95 are based on a sub- 
set of t he ground-based surveys of 'Sargent et al.l (11989 ) and 
IStengle r-Larrea et al. (1995). The sub-samples are roughly 
equivalent in their statistical significance (comparable total 
pathlength and number of LLSs), with path-weighted mean 
redshifts of (z) = 2.08 and (z) = 2.84, respectively. The path- 
weighted mean redshift for the full sample is (zlls) = 2.41 
based on 46 LLSs with log(A^Hi/cm"^) > 17.2 in a total red- 
shift path of Az = 25.95 (AX = 88.31). These LLS samples 
provide a maximum likelihood estimate of the number of LLS 
per unit pathlength, dnu^^/dX = 0.52 ± 0.08. 

A recent study of LLS near the redshift range of inter- 
est (2.0 < Z < 2.6) u sing uniform HST data is described in 
lO'Meara et al.l (120121) . They measure an integral constraint on 
the number of LLS absorbers with Tqi2 > l,log(A^Hi/cm~^) > 
17.2 and T912 > 2,log(A^Hi/cm~^) > 17.5, where T912 is the 
optical depth at the Lyman Limit, 912A, in the absorber's 
rest frame. These authors found dnr>\/dX = QA%']}q^^^2^ in 
excellent agreement with our compilation described above 
{dtiu^^/dX = 0.52 ± 0.08). For more optically thick systems, 
lO'Meara et all dMl) found dnr>i/dX = 0.28!|J:{]^. The con- 
sistency of these measurements with the lower- A^hi frequency 
distribution will be discussed further in ^ 

Unfortunately, surveys that rely on detecting breaks in the 
spectrum of QSOs at the redshift of the LLS can rarely mea- 
sure A^Hi accurately, especially for absorbers with T912 > 2. 
As such, a measurement of the power law index of f{N,X) 
in the LLS regime typically relies on the combination of in- 
tegral constraints (dn/dX) for the LLS, subDLA, and DLA 
regimes, as well as the measured values of f(N,X) for Lya 
forest absorbers. As we now have much more statistically 
robust measurements of /3 for Lya forest absorbers with 
log(A^Hi/cm"^) < 17.2, we re-examine the power law fit to 
f(N,X) in the LLS regime. 

Here, we test the consistency of the measured incidence 
of LLSs with the extrapolated slope inferred from the MLE 
method for absorbers with log(A^Hi/cm"^) < 17.2. First, 
we note that LLS surveys are sensitive to all absorbers 



with log(A^Hi/cm^) > 17.2. Observations of DLAs (e.g., 
iNoterdaeme et al.ll2009l) show that there is a strong turnover 
in the frequency of DLAs at log(A^Hi/cm"^) > 21, suggest- 
ing there are very few absorbers with log(A^Hi/cm"^) > 22. 
Here, we assume that dnuis/dX is sensitive to absorbers with 
17.2 < log(A^Hi/cm-2) > 22.0. 

Next, we consider the incidence of LLSs that would be 
found if the MLE power law fit constrained by our data at 
log(A^Hi/cm"^) < 17.2 held for the entire LLS regime. We 
calculate the incidence of LLSs given the fit parameters as 
follows: 



dni^Ls/dX = 



dni^i^SM/dX -- 



f{NHuX)dNHI 



'N,„„ 



CmN^^dNm- 



(8) 
(9) 



where log(Nmm/cm~^) = 17.2 and log(Nm^/cm~^) = 22.0. We 
find that this extrapolation to log(A^Hi/cm"^) > 17.2 underesti- 
mates the number of LLSs significantly giving drti^i^sM/dX = 
0.21 (compared to t/nLLS.data/fi^ = 0.52 ±0.08). 

The green triangle in Figure |2]represents a similar exercise 
graphically on the f(N,X) plot. If we assume that the MLE 
determination of f3 for the forest holds in the LLS regime, 
we can calculate the value of Chi.lls implied by the measured 
dnu^^/dX by inverting equation|9] Then the values of f{N,X) 
suggested by the data would be given by: 



f(Np,X) = Cm.LLsN, 



-P 



(10) 



for all values of A'p. The height of the green triangle thus rep- 
resents the values of f{N,X) if /9mle holds of the LLS regime. 
The dark green band at the top of the triangle is an estimate in 
the uncertainty of this normalization due to the uncertainty in 
dnu^^/dX. Again, we can see clearly that the MLE fit to the 
frequency distribution (red dotted line) significantly under- 
predicts the measured LLS incidence. 

We have seen that a single power law well approximates 
the intermediate-A^Hi absorbers; however, its extrapolation to 
log(A^Hi/cm"^) > 17.2 underestimates the number of LLSs 
significantly giving dnu^^f^ij dX = 0.21. In ^ we discuss 
other methods for parameterizing the distribution which better 
reproduce the high-A^Hi end of the frequency distribution. 

4. FREQUENCY DISTRIBUTIONS WITH RESPECT TO THE 
POSITIONS OF GALAXIES 



iRudie et aLJ ( 120 12h considered the distribution of H I with 
respect to the positions of star-forming galaxies with 2.0 < 
z < 2.8 using the absorption line data presented here. The 
study explored the kinematic and geometric structure of the 
CGM, concluding that the majority of the strong excess ab- 
sorption is found within 300 physical kpc (pkpc) of galaxies in 
the transverse direction and within 300 km s"' of the systemic 
redshift of galaxies; however, a tail of excess H I absorbers 
continues to 2 pMpc and 700 km s"' . 

The KBSS spectroscopic galaxy sample consist of UV- 
colo r selecte d galaxies with UV luminosities 0.25 < L/L* < 
3.0 dReddv & Steidel 200^ ) and a typical bo l ometri c lumi- 
nosity of - 2.5 X 10"Lo dReddv et all 12008 1. 2012). They 
have star formation rates of ~ 30 Mq yr~' ( fErb et al..i2006l) 
and halo masses of ^ lO'^ M(^ (Adelbe rgeret al.l 120051: 
IConrovet all 120081: lTrainor& St"eidelii2012i O. Rakic et al. 
2012 in preparation). 
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Figure 4. Same as Figure |2]but also considering tlie distribution of absorbers close to galaxies. The IGM distribution is shown in grey. Red 
points show the frequency of absorbers of a given A'hi within 300 km s"' and 300 pkpc of a galaxy in our spectroscopic sample. Blue points 
show the frequency of those within 700 km s"' and 300 pkpc. The curves are the maximum likelihood fits to the distributions over the range 
13.5 < log(A'Hi/cm"^) < 17.2. The parameters of the fit are given in Table|4]along with the opacities calculated from the data and the opacities 
implied by the fit. Notably this single power law fit to the data does not well reproduce the opacities calculated from the data. Section [STT] 
discusses broken-power law fits which better reproduce these opacities. 

300 km s"' CGM than within 700 km s~\ as expected.^ 

The characteristic A^hi of the CGM is akin to the 
max(A^Hi ) statistic presente d in the left-hand panel of Fig- 
ure 9 of iRudie et al.l (l2012h . These authors typically found 
logCA^Hi/cm"^) > 14 .5 - 15 within 300 pkpc of a galaxy. 
iMisawa et al.l (120071) considered the frequency distribution of 
absorbers surrounding H I systems with log(A^Hi/cm"^) > 15, 
fitting to the absorber distribution within ± 1 00 km s~' of the 
redshifts of these tracer absorbers. Thus, the iMisawa et al.l 



Usin g the same KBSS galaxy sample as in iRudie et all 
(120121) . we measure the frequency distribution of H I ab- 
sorbers within the CGM as follows. First we locate those 
galaxies in our spectroscopic sample lying within 300 pkpc of 
the line of sight to the QSO and within the redshift range used 
for the Lya forest decompositions (as listed in Table [TJ. No- 
tably, we have no galaxies at impact parameters < 50 pkpc, 
and so the CGM distributions apply to the range 50 - 300 
pkpc. We then consider the column density distribution of 
absorbers within ±300 km s~' and ±700 km s"' of the red- 
shifts of these galaxies. We note that all absorbers included 
in the 300 km s"' CGM are also included in the 700 km s"' 
CGM. 

The resulting frequency distributions for the 300 km s"' and 
700 km s"' CGM are s hown in Figure |4| i n red and blue re- 
spectively. As found by IRudie et alJ (120121) . the frequency of 
all absorbers with log(A^Hi/cm~^) > 13.5 is higher within the 
CGM of galaxies than at random places in the IGM. 

The maximum likelihood estimates of the power law ap- 
proximation to f(Niii,X) within the CGM are listed in Table 
m and overplotted in Figure |4] Notably, the power law slope 
of the CGM is significantly shallower than that of the IGM, 
in the sense that there are relatively more high-A^Hi systems 
than low-A^Hi absorbers in the CGM. Th is result is qualita- 
tively equivalent to results presented by IRudie et aH (120 12h 
who found a strong coiTelation between A^hi and the fraction 
of absorbers found within the CGM of galaxies (their Figure 
30). Further, the frequency of absorbers is higher within the 



(i2007i) sample traces a volume similar to that of the CGM. 
For their S2a sample which was selected to lie at least 5000 
km s~ ' lower than the redshifts of the QSOs, IMisawa et aLJ 
(120071) find a power-law slope of /3 = 1 .390 ± 0.027, in reason- 
able agreement with the power-law exponents for our CGM 
measurements. The normalization, Chi = 7.262 ±0.431, is 
lower than our CGM results, consistent with the expectation 
that the lMisawa et al.i (2007) sample includes a larger fraction 
of "general" IGM compared to CGM. 

5. THE LYMAN CONTINUUM OPACITY 

One of the principal uses of the column density distribution 
presented above is to estimate the transmissivity of the IGM 

^ Since the majority of the excess absorption is found within 300 km s"' of 
galaxies, the 700 km s"' CGM is diluted by regions with H I properties more 
similar to the general IGM. As such, the difference between the 300 and 700 
km s"' CGM is mostly a result of the difference in pathlength considered. 
The pathlength for the 300 km s"' CGM sample is AX = 0.9 while the 700 
km s"' sample has twice the pathlength. AX = 1.9 - and since f(Nni,X) is 
inversely proportional to the pathlength, the frequency of absorbers in the 700 
km s"' CGM is approximately a factor of 2 smaller 
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Table 4 

MLE Power Law fits to f(Nni,X) 



Sample 



/3 



log(CHi) Kflt [pMpc 



Kdala" [pMpC 



IGM 1.650 ±0.017 10.322 0.0029 

CGM 700km S-' 300 pkpc 1.447±0.033 7.672 0.013 

CGM 300km S-' 300 pkpc 1.425 ±0.037 7.562 0.022 



0039+"'""'^ 

()17+O.004 
"■"^ '-0.004 



0.031+ 



' Opacity calculated from the the data using equation [Tsl for 13.5 < log(A'Hi/cm ^) < 17.2 
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Figure 5. Contributions to the opacity from a given bin in A^m, measured directly from the data using equation [15] Left: The IGM opacity per 
bin in A'hi- Right: The opacity per bin in Nm within the IGM (green), the 700 km s"' CGM (blue) and the 300 km s"' CGM (red). Note that the 
scale of the left-hand panel is 10% of the scale of the right-hand panel. The opacity (optical depth per unit physical distance) of the CGM is an 
order of magnitude larger than the average opacity in the IGM. 



to hydrogen-ionizing photons, hereafter referred to as Lyman 
Continuum (LyC) photons. Understanding LyC emission and 
its transmission through the IGM is an important step in ex- 
ploring the process of reionization at high-redshift, as well 
as estimating the average ionization level of the IGM and the 
metagalactic ionizing background as a function of redshift. 

In order to understand and describe the process of transmis- 
sion of LyC photons through the IGM, one must first quantify 
the sources of opacity in the IGM. Most notably, it is impor- 
tant to quantify the fractional contribution of absorbers of var- 
ious A^Hi to the full opacity of the forest. To do this, we begin 
with a simplified model of LyC emission and absorption. 

The attenuation of LyC photons by an absorber with A^hi= 
A^, is given by e~^'°' where a is the photoionization cross sec- 
tion of hydrogen. In practice, a depends sensitively on the 
energy of the photons considered: 



suming that the universe is static.*^ In practice, the expansion 
of the universe results in the redshifting of ionizing photons 
to non-ionizing energies; however since we are only consid- 
ering 1 Ry photons, we will also temporarily ignore redshift- 
ing. In ^ we perform a more rigorous calculation to measure 
the true mean free path of LyC photons through the IGM and 
CGM at z=2.4 including photons of all ionizing energies and 
including redshifting. The analytic calculation presented in 
this section is provided to build intuition regarding the sources 
of opacity in the universe, and also to quantify the opacity 
from absorbers with Iog(A^Hi/cm"^) < 17.2 inferred directly 
from the data. 

Using the above approximations, we calculate the opacity, 
K, of the IGM and CGM to LyC photons and the contribution 
to this opacity from absorbers of various A^hi- The opacity is 
defined as the change in optical depth per unit proper distance: 



cr(i^) w (Tll 



(^) 



(11) 



K = 



dr 
dr 



where 



where ctll = 6.35 x 10 '^ cm^ is the hydrogen ionization cross 
section to photons at the Lyman Limit (912 A) with energies 
of 1 Ry (13.6 eV), and vll is the frequency of such photons. 
For the moment, we will employ the simplifying assumption 
that all LyC photons have energies of 1 Ry so that the fre- 
quency dependence of a is removed. 
In addition, for the time being, we calculate the opacity as- 



dr= dz 

l+zH(z) 



(12) 



(13) 



and c is the speed of light. 

^ For the remainder of Section [5] we use the redshift distribution of the 
sample to infer the physical distance over which absorbers are distributed. 
This is the only way in which redshifts enter into the calculation. 
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Given an analytic fit to the frequency distribution per 
unit physical distance f(Niii,r), the opacity due to Poisson- 
distributed absorbers with A^^in < A^hi < A'max to photons of 
energy 13.6 eV is: 

i^m= / /(A^Hi,'-)(l-e-^"-""Oa'A^Hi (14) 



dParesce et al.llT980l) . 

Similarly, the opacity can be calculated directly from the 
data by summing over those absorbers {i} with A'min < A^hi < 

^ * max • 

X;,(l-e"^""'""-) 



'i^data ■ 



Ar 



(15) 



where Ar is the sum of the proper distances probed by each 
sightline. 

The uncertainty in the measurement of Kdata can be calcu- 
lated using the bootstrap resampling method in which many 
sets of absorbers are drawn with replacement and Kjata is re- 
calculated for each set. One can then estimate the proba- 
bility distribution of Kdata by selecting the 68% confidence 
range. Values of Kdata for the IGM and CGM for 13.5 < 
log(A^Hi/cm"^) < 17.2 is listed in the last column of Table|4] 
We emphasize that this measurement of the LyC opacity from 
absorbers with log(A^Hi/cm~^) < 17 is one of the principal re- 
sults of this work. 

The left hand panel of Figure|5]shows the opacity produced 
by various bins of log(A^Hi/cm"^) < 17.2 within the IGM cal- 
culated directly from the data using equation [15] These val- 
ues are compared to the opacity found within the CGM in the 
right-hand panel of Figure|5] Notably, the opacity of the CGM 
of a galaxy (300 km s~', 300 pkpc) is an order of magnitude 
larger than that of the average IGM. 

Comparing the opacities calculated using the single power 
law MLE fit with those calculated directly from the data for 
absorbers with log(A^Hi/cm~^) < 17.2 (Table |4|l, one notes a 
statistically significant discrepancy in the sense that the MLE 
power law predicts far less opacity than is measured directly 
from the data. Similarly, the number of LLSs (which also 
contribute significantly to the LyC opacity) is under-predicted 
by the MLE fit. In Section 15.11 we consider other param- 
eterizations of the frequency distribution that better match 
dn^Ls/dX as well as the opacity from intermediate-A^Hi sys- 
tems [15 < log(A^Hi/cm"^) < 17] calculated from the data. 

5.1. Broken Power Law Parameterizations 

While the MLE power law fits presented in ^3.2l and ^pro- 
vide a reasonable description of the overall column density 
distribution, because there are substantially more absorbers at 
low-A^Hi the majority of the statistical power for the fit is de- 
rived from the Iow-A^hi absorbers. To estimate the LyC opac- 
ity of the intergalactic and circumgalactic gas, the absorbers 
of relevance are instead the much rarer, high-A^Hi systems, par- 
ticularly those near log(A^Hi/cm"^) w 17. 

In order to reproduce simultaneously the opacity from both 
absorbers with 12.5 < log(A^Hi/cm~^) < 17.2 as measured in 
our sample, as well as the number of LLSs, we adopt a dif- 
ferent fitting method. Here we consider a least-squares fit to 
the binned'' data. Because the opacity is a strong function of 
A^Hi, it is most important that the number of absorbers with 

' We have verified that the fitted parameters are insensitive to the exact 
choice of binning. 



log(A^Hi/cm"^) w 17.2 is accurately reproduced. Therefore, to 
provide further flexibility, we allow for a discontinuous break 
in the power law parameterization. This allows for an inde- 
pendent fit to the high-A^Hi absorbers [log(A^Hi/cm"^) > 15] 
which are the main source of opacity. Although no strong 
break is evident, any parameterization that does not include a 
break as well as all parameterizations with a continuous func- 
tional form at the position of the break cannot reproduce the 
opacity of Lya forest absorbers as measured by the data. 

We fit two power laws to the binned data. The bin that 
includes the break is included in the fit to the low and 
intermediate-A^Hi groups to minimize the discontinuity be- 
tween the two power laws. We allow the break to vary in 
A^Hi and select the break location to be that which best repro- 
duces the opacity from absorbers with log(A^Hi/cm~^) < 17.2 
and also the number of LLS in the case of the IGM sample. 
The break location for the two CGM distributions is forced 
to match. The best fits are presented in Figure |6] and Table |5] 
along with their implied opacities, Kf^. The bin containing the 
break is indicated by the hatched box in Figure |6] Notably, 
the location of the break (A^hi ~ 10'^ cm"^) is similar to the 
A^Hi at which absorbers were found to significantly correlate 
with the positions of galaxies (.Rudie et al...201Z) . 

This fitting technique reproduces the opacity from 
intermediate-A^Hi absorbers measured from the data with high 
fidelity. Additionally, the IGM fit predicts a dnu^^,ftt/dX = 
0.519 for 17.2 < log(A^Hi/cm"2) < 22.0 in exceflent agree- 
ment with dnu^si^tj dX = 0.52 ± 0.08 determined from the 
compilati on of LLSs from the literature (Section 13. 3t . '" As 
in section [331 the green triangle in Figure [6] represents the 
values of f{N,X) within the LLS regime implied by the 
measurement of dni^i^s.&dt^/dX assuming a power law index, 
P = Aigh-NHi ■ The agreement between the height on the green 
triangle and the extrapolation of the power law fit (grey line) 
suggests that the broken power law parameterization well re- 
produces the LLS gross statistics. 

Next we consider the uncertainty in the fitted parameters. 
The majority of the uncertainty in the normalization of the 
frequency distribution (Cnhi) is due to its covariance with the 
power law index, /3. In Table [6] we list the best fit deter- 
mination of /?high-NHi and Chigh-Nm and the values of k^ and 
dni^i^5,fy/dX implied by this analytic approximation. Also 
listed are the l-cr perturbed values of /3high-Nnn the corre- 
sponding normalization Chigh-NHP and their implied k^ and 
dnu^s.fit/dX. Notably, the perturbed analytic parameters re- 
produce the opacity due to Lya forest absorbers within the 
measurement errors. The incidence of LLSs is more sensitive 
to perturbed values of /? with offsets in the implied incidence 
of ^ 1.5cr from the observed incidence. As the incidence of 
LLSs is not used to constrain the power-law fit, the disagree- 
ment between these values is not surprising. 

Figure[7]compares the IGM broken power la w fit to the fre- 
quency of LLS with rcii 2 > 1 and t^i2>2 from lO'Meara et al] 
( I2OI2I) as described in ^3.31 The blue triangle corresponds to 
T<)\2 > 1 absorbers, and the r ed triangle represents the rg^ > 2 
absorbers. As in section [373] the normalization of the triangles 
are set by the measured value of dni^i^s^-^t^/dX under the as- 
sumption that an extrapolation of the broken power law fit 
to the Lya forest data holds within the LLS regime. Again, 
the agreement between the extrapolated power-law fit (grey 

'" We remind the reader that the single power law MLE determination of 
the fit predicted a value of dnu^^f^/dX = 0.21. 
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Figure 6. Broken power law least-squares fits to the binned IGM (grey) and CGM (blue and red) data. The slopes and normalizations for each 
data set (IGM or CGM) are allowed to vary independently; however, both ranges include the data point centered at the break (shaded regions). 
Note that the broken power-law fit is discontinuous at the break location. Including one data point in both ranges minimizes the discontinuity 
associated with the break and produces a better fit to the data. The hatched box sho ws th e range of A'hi included in that point and thus in both 
fits. The green triangle represents the measured incidence of LLSs as discussed in 93.31 The parameters of the fit are given in Table[5]along 
with the opacity calculated from the data and the opacity produced by the fit to the distribution. Note that the opacity is reproduced to much 
higher fidelity using the broken power law fit. 



Table 5 
Broken Power Law fits to f(Nni,X) 



Sample 



Plow-Nu 



log(Clow-NH, ) ^^^^ ^HI 



Aigh-NH 



log(Chigh_NH|) Kflt [pMpc 



Kdata" [pMpC 



IGM: 1.656± 0.030 

CGM 700km s"' 300 pkpc 1.353± 0.088 
CGM 300km s"' 300 pkpc 1.302± 0.084 



10.398 


15.14 


1.479± 0.042 


7.643 


0.0038 


n039+<''""'5 


6.354 


14.87 


1.385± 0.084 


6.789 


0.017 


(1 fi 17+0.004 
"■^^ '-0.004 


5.823 


14.87 


1.345± 0.089 


6.394 


0.031 


nnTi+O.OOS 
"■"^'-0.009 



' Opacity calculated from the the data using equation [T5]for 13.5 < log(Nni/cra ^) < 17.2. 



curve) and the blue and red triangles suggests that the ana- 
lytic approximation to the frequency di stributio n reproduces 
the measure LLS statistics from O'Meara et al.l (12012) . The 
green point in Figure Q shows the measured frequency of ab- 
sorbers with 1 < T912 < 2 [17.2 < log(A^Hi/cm"^) < 17.5] cal- 
culated from the difference between the measured values of 
dni^i^s^r>i/dX and dnu^^^ryi/dX. 

The power law fit to f{N,X) suggested bv lO'Mearaetal.1 
(120 12h is overplotted as the red dotted line for comparison. 
This curve would imply a value for the opacity from absorbers 
with log(A^Hi/cm"^) < 17.2 of Kfit = 0.0022, nearly a factor 
of two smaller than that measured from the KBSS sample 
(Kdata = 0.0039 ±0.0005). 

There are likely some deviations in the detailed distribution 



of f{Nm,X) in the subDLA (19.3 < log(A^Hi/cm-2) < 20.3) 
and DLA (log(JVHi/cm ~^) > 20.3) regimes as d iscussed in 
iProchaska etal] (120091) and lO'Mearaet al.l (120121) . We show 
the current constraints on these regions of the frequency dis- 
tribution in the right hand panel of Figure |7] We chose to 
neglect the differences between the power law fit and ob- 
served constraints on f{N,X). Holding the measured value 
of dni^Ls/dX fixed, such an assumption makes our measure- 
ments of the opacity from absorbers with A'hi > 10'^^ cm"^ 
a lower limit. However, since the attenuation resulting from 
any system with log(A'Hi/cm"^) > 18 is similar, the detailed 
shape of the distribution for high-A^ei will introduce relatively 
small changes in the opacity so long as the total number of 
such systems is reproduced. 
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Table 6 

Effect of uncertainty in /3 on k and d«LLS MX''^ 




Sample 


/3high-NHi 


log(Chigh-NH,)'' 


K[pMpc-'r 


dnu.s,/dX-^ 


Data= 

IGM Best Fit 
IGM Shallow Fitf 
IGM Steep Fit^ 


1.479 
1.437 
1.521 


7.643 
6.984 
8.301 


0039+*"""'5 

0.0038 
0.0040 
0.0035 


0.52 ±0.08 

0.519 
0.656 
0.413 



" The effect of varying /3high-NHi within its l-c uncertainty. 
^ The best fit normalization of f{N,X) given the stated value of /3. I.e., the co- 
variance of log(Chigh-NH,) *'* Aligh-NHi- 
' The opacity from absorbers with logCWm/cm"-) < 17.2. 
^ The incidence of LLSs 
' Values computed directly from the data. 

f The fit for the perturbed value of /Shigh-Nm such that the distribution is more 
shallow. 
8 The fit for the perturbed value of Aiigh-Nni ^^ch that the distribution is steeper. 
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Figure 7. Same at Fi gure [6] but only considering the IGM distribution. Left: The results of a recent survey for Lyman limit absorbers by 
lO'Meara et al.l | |2012|) . Note that these authors' constraints on r9i2 > 1 and Tgn > 2 LLS are well reproduced by the extrapolation of the power 
law slope selected for the high-Wm IGM absorbers. Computing the difference between these values allows for an assessment of the number of 
LLSs with 1 < Tgi2 < 2, here represented as the green data point. The (red) dotted curve shows the fit to the frequency distribution proposed by 
lO'Meara et alj ( l2()12l) . Note that this fit underpredicts the number of absorbers with log(A'Hi)/cm"^) < 17.2 as measured with the KBSS sample 
(black data points). Right: Including all currently known constraints at these redshifts for f(N,X). In green the 1 < Tgn < 2 measurement 
from lO'Meara et all ( i20l2h . In blue are the subDLAs (19.3 < log(A^Hi/cm^ ^) < 20.3; dn,ubDLA/dX = OMO^xm) from lO'Meara et al.l ( i2007h 
and in red are the DLAs (log(A'Hi/cm"^) > 20.3; dnoLA/dX « 0.05) from Noterdae me et"aLl ( 120091) . We note that the extrapolation of the 
intermediate-WHi absorbers power law fit suggested by our Lya forest data appears to provide a good fit to the data for log(A'Hi/cm"") < 18. 
Above this A'm, nature likely deviates significantly from this power law. 



The opacity inferred from the power law fit to f(N,X) is 
a lower limit for the following reason. If we hold diiu^s/dX 
fixed at the observed value, this gives us a constraint on the 
number of systems with A^hi > lO'^'^ cm"^. The assumption 
of a given value of the power-law index, /3, allows us to in- 
fer the opacity from a measurement of diiu^s/dX. Allow- 
ing /3 to take a shallower value to match the plotted points of 
f{N,X) for the subDLAs and DLAs only moves absorbers to 
higher A^hi and thus higher opacity compared to a model with 
a steeper power-law index. Therefore, our suggested opaci- 
ties are likely a slight under-prediction of the true value. If 
we instead choose a flat value {(3 = 0) we would find 9.7% 
higher opacity from LLSs, suggesting the total uncertainty in 



the opacity from the assumed value of f3 is significantly less 
than 10%. 



5.2. The Opacity of the IGM and CGM 

Under the assumptions outlined above, and with a param- 
eterization of the frequency distribution that reproduces both 
the incidence of LLS and the opacity from intermediate-A^ei 
absorbers, we can now measure the full opacity of the IGM 
and CGM. We calculate the opacity from LLS absorbers with 
log(A^Hi/cm"^) > 17.2 using equation I14f' and the opacity 

' ' Note that the detailed distribution of k per bin in Nm would be affected 
by variation in f(Nm) at DLA and subDLA column densities. Because we 
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Table 7 

Opacity and Optical Depth of Absorbers of various Whi 



Nyii min A'hi max method 



KiGM TlGM ftCGM 700 fcGM 700 

xlO' [pMpc-'] xlOO xlO' [pMpc-'] xlOO 
Differential: A'hi min < log(A'Hi/cm"^) <A'hi max 



I^IGM TlGM I^CGM 700 TcGM 700 

xlO' [pMpc-'] XlOO xlO'lpMpc-'] XlOO 
Cumulative: 12.55 < logiNm/cmr-) <A'hi max 



12.55 


12.60 


data 


0.001 


0.02 


0.001 


0.00 


0.001 


0.02 


0.001 


0.00 


12.60 


12.65 


data 


0.002 


0.02 


0.002 


0.00 


0.003 


0.03 


0.002 


0.00 


12.65 


12.71 


data 


0.002 


0.02 


0.002 


0.00 


0.005 


0.06 


0.004 


0.00 


12.71 


12.77 


data 


0.003 


0.03 


0.003 


0.00 


0.008 


0.09 


0.007 


0.00 


12.77 


12.85 


data 


0.004 


0.04 


0.003 


0.00 


0.012 


0.13 


0.009 


0.00 


12.85 


12.93 


data 


0.004 


0.04 


0.003 


0.00 


0.016 


0.17 


0.012 


0.00 


12.93 


13.01 


data 


0.006 


0.07 


0.006 


0.00 


0.022 


0.24 


0.018 


0.01 


13.01 


13.11 


data 


0.008 


0.08 


0.007 


0.00 


0.030 


0.32 


0.024 


0.01 


13.11 


13.21 


data 


0.009 


0.10 


0.010 


0.00 


0.039 


0.42 


0.034 


0.01 


13.21 


13.32 


data 


0.011 


0.12 


0.011 


0.00 


0.050 


0.54 


0.046 


0.01 


13.32 


13.45 


data 


0.017 


0.18 


0.016 


0.00 


0.067 


0.72 


0.061 


0.02 


13.45 


13.59 


data 


0.021 


0.22 


0.022 


0.01 


0.088 


0.94 


0.083 


0.02 


13.59 


13.74 


data 


0.024 


0.25 


0.032 


0.01 


0.112 


1.20 


0.116 


0.03 


13.74 


13.90 


data 


0.027 


0.29 


0.027 


0.01 


0.140 


1.49 


0.143 


0.04 


13.90 


14.08 


data 


0.038 


0.40 


0.053 


0.02 


0.178 


1.89 


0.195 


0.06 


14.08 


14.28 


data 


0.045 


0.48 


0.089 


0.03 


0.223 


2.37 


0.285 


0.08 


14.28 


14.50 


data 


0.066 


0.70 


0.073 


0.02 


0.289 


3.07 


0.358 


0.10 


14.50 


14.73 


data 


0.095 


1.01 


0.198 


0.06 


0.383 


4.08 


0.557 


0.16 


14.73 


15.00 


data 


0.115 


1.23 


0.353 


0.10 


0.499 


5.31 


0.910 


0.26 


15.00 


15.28 


data 


0.123 


1.30 


0.351 


0.10 


0.621 


6.61 


1.261 


0.36 


15.28 


15.60 


data 


0.202 


2.14 


0.511 


0.15 


0.823 


8.75 


1.772 


0.51 


15.60 


15.94 


data 


0.314 


3.34 


1.320 


0.38 


1.136 


12.09 


3.092 


0.89 


15.94 


16.32 


data 


0.471 


5.01 


1.316 


0.38 


1.607 


17.10 


4.408 


1.26 


16.32 


16.73 


data 


0.661 


7.03 


4.278 


1.23 


2.268 


24.13 


8.686 


2.49 


16.73 


17.20 


data 


1.695 


18.03 


8.571 


2.46 


3.963 


42.17 


17.257 


4.95 


17.20 


17.70 


fit 


1.570 


16.70 


9.249 


2.65 


5.534 


58.87 


26.506 


7.60 


17.70 


18.20 


fit 


1.111 


11.82 


7.273 


2.09 


6.645 


70.69 


33.779 


9.68 


18.20 


18.70 


fit 


0.647 


6.88 


4.719 


1.35 


7.291 


77.57 


38.498 


11.04 


18.70 


19.20 


fit 


0.372 


3.96 


3.030 


0.87 


7.664 


81.53 


41.527 


11.91 


19.20 


19.70 


fit 


0.214 


2.28 


1.946 


0.56 


7.878 


83.81 


43.473 


12.46 


19.70 


20.30 


fit 


0.141 


1.50 


1.439 


0.41 


8.019 


85.31 


44.912 


12.88 


20.30 


22.00 


fit 


0.127 


1.35 


1.597 


0.46 


8.146 


86.67 


46.509 


13.33 



' Opacity calculated from the the data using equation [T5]for 12.55 < logCWm/cm ~) < 17.2. 
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from intermediate-A^Hi absorbers with log(A^Hi/cm"^) < 17.2 
directly from the data using equation [15] (as described in the 
beginning of Section |5]l. The differential opacity from bins 
in A^Hi as well as the cumulative opacity (from absorbers with 
A^Hi smaller than a given value), are given in Table|7] The total 
opacity of the IGM is measured to be: 



KIGM = Kdata (A^HI < lO'^'^) + Kfit (A^HI > lO' 

KiGM = 8.146x IQ-^pMpc-'. 



Similarly, 



KCGM 300 km s-i =94.4 X 10 pMpC 
«^CGM 700 km s-' =46.5 X 10"^ pMpc" 



(16) 
(17) 

(18) 
(19) 
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Figure 8. The optical depth of the CGM tol3.6 eV photons. The bins 
with log(A'Hi/cm"^) < 17.2 (solid shading) are measured directly 
from the data. The bins corresponding to LLSs (log(A'Hi/cm"^) > 
17.2; hatched shadding) use the broken power law fit to the higher 
A'hi end of f(Nni,X) as shown in Figure |6] The blue and red his- 
tograms show the total optical depth within one average CGM. This 
is equivalent to the opacity times the scale length ArcGM- Note that 
the total optical depth in the 700 km s"' CGM includes the opacity 
within the 300 km s"' CGM but that there exists additional opac- 
ity due to absorbers with velocity offsets between ±300 and ±700 
km s"'. 



Another relevant quantity is the integrated opacity over a 
specific scale length, i.e., the optical depth r. In particular, 
it is now possible to calculate the optical depth of the CGM 
of an individual galaxy. This quantity is particularly relevant 
for understanding the attenuation of ionizing flux emanating 
from a galaxies similar to those in the KBSS sample due to 
the COM of the galaxy itself. Here we attempt to separate the 
probability that ionizing photons escape the ISM of a galaxy 
from the probability that photons, having already escaped the 
ISM, will also escape the enhanced absorption found within 
the CGM. 

adopt the extrapolation of the broken-power law fit to f{N,X) to calculate the 
opacity, the values in individual bins of Whi for log(A'Hi/cm"^) > 17.2 are 
not well constrained. 
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Specifically, the KBSS data set constrains the opacity and 
optical depth of the CGM of galaxies at impact parameters 
greater than 50 kpc'^. Henceforth, we consider the classical 
"escape fraction" to be the probability that an ionizing pho- 
ton reaches a distance of 50 pkpc, while the CGM is defined 
to begin at a distance of 50 pkpc from the galaxy. In order to 
calculate the optical depth, the scale length of interest must be 
assumed. For circumgalactic absorption, the velocity window 
associated with the CGM provides a natural scale length. As 
the relevant scale is along the line of sight, we use the dis- 
tance, ArcGM, associated with the line-of-sight velocity am- 
plitude of the CGM: 



ArcGM =H-\z)dw 



(20) 



where H(z) is the Hubble constant evaluated at the mean red- 
shift of the sample (z) = 2.4 and dv is either 700 km s"' or 300 
km s~^ We note that the KBSS data set constrains the distri- 
bution of gas at transverse distances > 50 pkpc from galaxies, 
not along the line of sight from galaxies. Here we assume that 
the gas distribution traced in the transverse plane well repre- 
sents the gas distribution along the line of sight to a galaxy at 
distances > 50 kpc. Given the strong in crease in A ^ht at small 
galactocentric distances [as observed in iRudie eFal.l (|20 121) 1. 
the optical depth associated with the CGM at distances > 50 
kpc is likely significantly smaller than the optical depth asso- 
ciated with the closer-in CGM. Therefore the measurements 
presented here might be thought of as a lower Umit on CGM 
absorption. 

In Figure [S] we compare the optical depth of the CGM 
(tcgm = 'tCGiviArcGM) for the 300 km s"\ 300 pkpc CGM 
(red) and the 700 km s"', 300 pkpc CGM (blue). Figure H] 
also quantifies the optical depths from both forest absorbers 
(log(A^Hi/cm-2) < 17.2) and LLS in the CGM. The forest ab- 
sorber opacities and optical depths are measured directly from 
the data, while the LLS opacity and optical depths are inferred 
from an extrapolation of the high-A^ei fit to the data using the 
broken power law parameterization. The values for tcgm and 
'tCGM in differential and cumulative A^hi bins are given in Ta- 
ble |7] 

We find that the optical depth of the 700 km s"' CGM is 
comparable to but slightly exceeds the optical depth in the 
300 km s"' CGM. This is as expected because the 300 km s"' 
CGM includes the majority of the excess absorption; however, 
there is a small additional excess between 300 and 700 km s"', 
see lRudie et alJ (l2012h . For subsequent calculations, only the 
700 km s~' CGM will be considered since it captures the full 
optical depth of the CGM for d > 50 pkpc. 

5.3. The Distribution of Optical Depths in the IGM and CGM 

With an estimate of the opacity from all A^hi absorbers, one 
can easily calculate the fractional contribution of the CGM vs 
the IGM, as well as the contribution from absorbers of differ- 
ent A^Hi, to the mean free path. For this simplified calculation, 
we define the mean free path, Aj^j'"^, to be the average distance 
an ionizing photon with energy 1 Ry travels in a static uni- 
verse before having a probability of 1 -e"' of being absorbed. 
More quantitatively: 



>^ = ^An 



*mfp 






(21) 



'^ This is because tlie closest galaxy to a QSO sightline in the KBSS sam- 
ple is at a projected physical distance of 50 kpc. 
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Figure 9. The fraction of the optical depth within 1 mean free path 
contributed by absorbers within a given bin of A'hi in the IGM (green) 
or 700 km s"' CGM (blue). The solid histograms represent measure- 
ments from the data converted into an optical depth using equations 
[12] and [15] The hatched histograms represent values from the fit to 
the data extrapolated to higher values of A'hi and then converted into 
an optical depth using equations 1121 and [T4| The printed numbers 
give the percentage of the opacity due to that A'hi bin. 
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Figure 10. The cumulative fraction of the optical depth within 1 mean 
free path contributed by absorbers below a given A'hi in the IGM 
(green) or 700 km s"' CGM (blue). The solid histograms represent 
measurements from the data converted into an optical depth using 
equations [12] and [TS] The hatched histograms represent values from 
the fit to the data extrapolated to higher values of A'hi and then con- 
verted into an optical depth using equations ll2land[T4l The printed 
numbers give the percentage of the opacity due to absorbers with A'hi 
less than or within the A'hi bin. 



where 



, KiAn = 1 



(22) 



Here we consider two possible mean free paths: (1) In the 
case of photons emanating from the ISM of a galaxy simi- 
lar to those in our spectroscopic sample (see Section |4), they 
must first traverse the CGM of the galaxy in which they were 
formed and then the general IGM. Here we refer to this mean 
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free path as ^^^p'^ icm+ccm-(^^ ^^^ second case is the more 
general one and the mean free path that has been considered 
by previous authors - the mean free path of photons travel- 
ing in the IGM, a quantity relevant to photons once they have 
escaped the CGM of whichever structure in which they were 
formed. This distance is referred to as A^"p"^jQ,y[. 

In the calculation that follows, we consider the contribu- 
tions to the opacity from the 700 km s"' CGM as well as the 
IGM. Under these assumptions: 



KCGMArcGM + 'iIGM(Aj^fpJ^icM+cGM ~ ArcGM) ■ 



1. 



(23) 



where ArccM = 2.87 pMpc as given in equation |20] using 
dv = 700 km s"' , and kigm = 0.008 1 pMpc"' and kcgm = 0.047 
pMpc"' are the total opacities over all A^hi- Solving equation 
we find A!^l'''jf;^^^cGM= 109.3 pMpc. When we exclude 



\ static 
^mfp, IGM" 



122.8 



mfp, 

the CGM component, we instead obtain 
pMpc, a 12% longer pathlength. 

With A^^fp'^jQ]y[^(-Q|y[ known, we can then calculate the frac- 
tional contribution to the optical depth from each bin of A^hi 
for both the IGM and CGM as shown in Figure |9] and Table |7] 
(middle columns). The cumulative version of this plot (i.e. the 
contribution to the optical depth for all A^hi< Ni) is given in 
Figure[TO]and in Table|7] right-hand columns. From these, we 
can see that the CGM accounts for 13% of the opacity within a 
'^mfpI'iGM+CGM- Farther, absorbers with log(A^Hi/cm"2) < 17.2 
contribute 48% of the opacity to LyC photons. 

In practice, the detailed distribution of opacity as a func- 
tion of A^Hi will be altered when photons of various energies 
are considered. The true A,nfp of LyC photons will be non- 
negligibly affected by the redshifting of photons to lower en- 
ergies, and by the presence of higher-energy radiation. For in- 
stance, 100 pMpc corresponds to Az w 0.25 at these redshifts. 
Photons emitted by a source at (z) =2.4 with energies close 
to the Lyman Limit therefore will redshift to non-ionizing en- 
ergies long before they are attenuated by e. 

In the next section, we consider a more rigorous calcula- 
tion of Amfp including the aforementioned effects using Monte 
Carlo simulations of discrete lines of sight. As we will show, 
a more proper treatment of the physics does not substantially 
affect the value of Amfp. 

6. THE MEAN FREE PATH TO HYDROGEN-IONIZING PHOTONS 

As we have shown in Section |5] the mean free path of LyC 
photons is a cosmological distance at z = 2.4. As such, the 
consideration of photons with energies > 1 Ry as well as in- 
clusion of redshifting of photons out of ionizing frequencies 
are necessary in order to measure the true A,nfp. It is also nec- 
essary to define very precisely what one means by the 'mean 
free path,' including the frame of reference in which the dis- 
tance is measured. Here we define the mean free path, A^tp, 
as the distance traveled from a source by a packet of photons 
before photons with energy of 1 Ry in the absorber's^^ frame 
are attenuated by an average factor of e. This distance is fun- 
damentally related to a measurement of Az, which we convert 
to the physical distance traveled by the photon through the ex- 
panding universe. 

We perform a Monte Carlo (MC) simulation of the LyC ab- 
sorption along many individual sightlines. For illustration, we 

'^ We consider the energy of the photon at the absorber's redshift because 
only photons with energy sufficient to ionize hydrogen when they intersect 
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Figure 11. The average transmission spectra from 10,000 runs of 
the Monte Carlo simulations of IGM only (purple and blue) and 
IGM+CGM(red and orange) absorption in the foreground of a z = 2.4 
emitter shown in the rest frame of the emitter The solid (blue and 
red) curves show spectra using MC simulations that include line 
opacity. The dotted (purple) and dot-dashed (orange) curves show 
spectra from simulations only including continuum opacity. The 
highlighted yellow area marks the region between 920 -1015 A in 
the frame of the emitter which is used to renormalize the line opacity 
spectra in Figure [12] The dashed horizontal lines mark the aver- 
age value of the transmission in this region, {1-Db}, for the spectra 
containing line opacity. The vertical dashed line marks the position 
of the Lyman limit in the frame of the emitter The line absorption 
seen in the red spectrum is due to the Lyman series line opacity from 
high-A'ai CGM absorbers. 

begin with a discussion of MC runs with an emission redshift 

ofz = 2.4. 

6.1. Monte Carlo Simulations 

To simulate absorption within intergalactic and circum- 
galactic space, we produced artificial absorption spectra 
whose absorption line distribution is matched to that observed 
in data at the same redshifts; such models are described in de- 
tail by Shapley et al. (2006). In this case, we produced 10,000 
simulated lines of sight to sources with Zem = 2.4, according 
to a frequency distribution of absorbers set using best-fit pa- 
rameters from the broken power la w fit to f(N,X) (Table|5]l. 

As was demonstrated in Section 15.11 various fitting meth- 
ods produce largely discrepant values for the parameters of 
the analytic form of f(N,X). The actual opacity represented 
by the data can be measured to significantly higher precision 
then the parameters of the analytic form of f(N,X). For this 
reason, using perturbed values of the power law parameters 
will result in significantly larger variation in the opacity along 
a line of sight than occur in the actual universe as measured 
by the data. As such, we use only the best-fit values of the 
power law fit to f{N,X) in the Monte Carlo. 

The simulated absorbers are distributed throughout the red- 
shift range of interest in such a way that they reproduce the 
overall redshift distribution of measured absorbers. The red- 
shift distribution of absorbers is typically parameterized as: 



-^ = no(l+z)^. 
az 



(24) 



an absorber are relevant to A, 



mfp- 



For the MC simulations, we have assumed that the Lyman- 
a forest absorbers with Iow-A^hi (those absorbers fit by the 
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Figure 12. The average normalized transmission below the Lyman 
limit of a z = 2.4 emitter. The red and orange curves pertain to 
transmission including the CGM and the blue and purple curves 
pertain to IGM only transmissions. The dotted (purple) and dot- 
dashed (orange) lines show the same spectra as in Figure [TT] from 
the continuum opacity only runs of the MC code. The solid (red 
and blue) curves show the spectra including line opacity normalized 
by (I-Db). The black vertical dashed line marks the position of 
the Lyman limit in the frame of the emitter. The dashed horizontal 
line shows where the normalized transmission is equal to e"' . The 
colored vertical lines mark the position where each of the spectra 
crosses the e"', signifying the position of one Amfp. These rest wave- 
lengths are converted into physical distances corresponding to the 
Anifp as described in the text. 

low-A^Hi power law in the broken power law parameterization: 
log(A^Hi/cm"^) < 15.1 for the IGM andlogCJVm/cm "^) < 14.9 
for the CGM ) evolve with 7 = 2.5 ( Kim et alj2002h . We have 
verified that the line sample presented in this paper exhibits 
evolution consistent with these measurements. All high-A^Hi 
absorbers are assumed to have 7 = 1.0; the va lue f or the LLS 
compilation reported in this paper (Section 13. 3t measured 
through maximum likelihood estimation is 7 = 0.93 ± 0.29. 

Briefly, we note that the redshift evolution for LLSs within 
the range 2 < z < 3 is well constrai ned by t h e data pre- 
sented by [ Ribaudo et al. (2011), Sarg ent etaH (119891) . and 
IStengler-La rrea et al.l (11995) as discussed in Section 13.31 
Mea surements of the e volution in the line density at z > 3 (see 
e.g.. lSongaila & Cow ie 2010; Prochaskaet al. 2010) suggest 
much steeper evolution with redshift (high values of 7). Be- 
cause our Monte Carlo simulations only include absorption 
^t 2 < z < 3, we use the value of 7 = 1 as measured over 
the relevant redshift range and emphasize that the exact value 
of 7 does not significantly affect the quoted A^fp within the 
redshift range 2 < z < 3. ^* However, we caution that the ex- 
trapolations of the trend of our Amfp measurements to z > 3 is 
much more dependent on the chosen value of 7 and is there- 
fore unlikely to predict the true value at higher redshift. 

An additional set of simulations was run to mimic lines of 
sight emanating from galaxies like those in the KBSS. In these 
simulations, H I absorbers were added within 700 km s"' of 
Zem according to the CGM broken power law fit to f(N,X). 
The remainder of the line of sight is drawn from the IGM dis- 

'* Considered in the redshift range 2.0 < z < 2.8, varying 7 by Icr results 
in less than a 6% change in the incidence of LLSs and therefore less than a 
3% change in A^fp- 



tribution. These simulations produce the expected attenuation 
due to the CGM for sources within galaxies similar to those 
in our spectroscopic sample (0.25 < L/L* < 3, (z) = 2.3). 

In addition to the simulated forest spectra that include both 
line and continuum opacity, we created another set where the 
opacity from individual Lyman lines is not included, leaving 
only Lyman continuum absorption. To measure the A,nfp, the 
principal concern is the fraction of photons absorbed while 
they are at ionizing energies. Once photons redshift out of 
the hydrogen-ionizing band, for the purpose of calculating the 
mean free path, their subsequent absorption or transmission is 
immaterial. For this reason, spectra with line and continuum 
opacity are useful for correcting observed objects for attenu- 
ation along the li ne o f sight through the IGM (as will be dis- 
cussed in Section|63|i, while the "continuum absorption only" 
spectra can be used to measure Amfp directly (as discussed in 
Section [6.2l i. 

Figure [TT] shows the average of 10,000 realizations of each 
Monte Carlo run plotted in the rest-frame of the emitter The 
solid curves show the runs with Lyman line opacity included, 
and the dotted and dash-dotted lines show the MC runs that 
only consider the continuum opacity. The various colors per- 
tain to either IGM ONLY (blue and purple) or IGMh-CGM 
(red and orange) runs (note the presence of Lya, /3, 7, etc. ab- 
sorption due to the CGM of the emitter in the red spectrum). 
Note that at the Lyman limit, the spectra that include CGM 
absorption show a sharper drop in transmission. This is due 
to LyC absorption from high-A^ni absorbers found in the cir- 
cumgalactic gas. 

6.2. Measuring Amfp 
To measure Amfp from the Monte Carlo calculation, we use 
the spectra that include continuum absorption only.'^ We 
search the average spectrum for the location where the LyC 
only spectra reach a transmission equivalent to e"', as marked 
by the vertical dash-dotted lines in Figure [12] The rest wave- 
lengths from Figure[T2]can be converted into the physical dis- 
tance traveled by a photon between Zem = 2.4, the emission 
redshift of the spectra, and Zmfp where: 



Ac-All,, , . , 

Zmfp = ^ (f +Zem) + Ze, 

All 



(25) 



where All = 91 1 .75 A is the wavelength of the Lyman limit 
and Xe is the rest wavelength of the spectrum at which the 
transmission equals e"'. To calculate Amfp, we then compute 
the physical distance traveled by the photon through the ex- 
panding universe, given by the integral over the proper line 
element: 

/■Zem 1 



where c is the speed of light and 



H(z) = Hq V^a + ^M+z)^ 



(27) 



Using the averaged LyC only spectra shown in Figure [12] 
we calculate the Amfp through the IGM assuming Zem = 2.4 
to be 147 pMpc (or Az = 0.359). If instead the CGM is 
included (as would be the case for LyC photons emanat- 
ing from a galaxy), the estimated Amfp falls to 120 pMpc 

'^ Line opacity only reduces the transmission of non-ionizing photons 
which are unimportant for consideration of An,fp. 
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(Az = 0.301), --20% shorter. Note that the values of Amfp 
found in this sections using the MC simulations are similar to 
those estimated using the analytic approximations in Section 
E](A^fp'iGM= 122.8 pMpc, X^= 109.3 pMpc). This provides 
confidence that the conclusions of Section |5] hold even when 
a more rigorous treatment is considered. 

6.3. Computing the mean free path uncertainties 

To calculate the uncertainties in the A^fp measurement 
made in Sections 16.21 we must estimate the error in 
the opacity that originates from the Lya forest absorbers 
(log(A^Hi/cm-2) < 17.2) and from LLSs (log(A?Hi/cm-2) > 
17.2). The errors associated with each must be estimated sep- 
arately because the constraints on the incidence of these ab- 
sorbers come from different surveys. 

To estimate the uncertainty associated with LLSs, we return 
to the measured value of dnu^^/dX = 0.52 ± 0.08 (Section|5]l. 
Using the method outlined in Section |5] we employ a fit to 
the frequency distribution to calculate the opacity, k. Once 
again, we assume that an extrapolation of the broken power 
law fit to the slope of the high-A^ei absorbers holds in the LLS 
regime. We then calculate the normalization of the frequency 
distribution that reproduces dnu^s/dX and its upper and lower 
limits. These normalizations and the assumed slope can then 
be used to measure klls and an approximate error We find 
klls = 0.0043 ± 0.0006 Mpc"', a 15% uncertainty. If we al- 
low the value of /3 to vary within the \-a confidence region 
(see Table |6]l, the uncertainty in our estimate of klls would 
be larger, -20-25%. 

To estimate the A^fp uncertainties from absorbers with 
log(A^Hi/cm"^) < 17.2 we employ the measurement of Kdata 
from Section |5] We thus assume that the uncertainty is dom- 
inated by sample variance rather than the uncertainties in the 
model. However, before using this value, we verify that the 
opacity (and its uncertainty) from log(A^Hi/cm"^) < 17.2 ab- 
sorbers measured from the data are reproduced in the Monte 
Carlo simulations using the assumed power-law fits. 

To estimate the opacity in the MC runs from forest ab- 
sorbers, we ran another set of MC simulations of LyC only 
opacity. In this case, we included only absorbers with 
log(A^Hi/cm"^) < 17.2, excluding the contribution from LLSs. 
Using the ensemble of model spectra, we performed a jack- 
knife test with a sample size of 15 sightlines, designed to 
match the number of lines of sight in the KBSS QSO sam- 
ple. Each set of 15 randomly-drawn spectra was averaged and 
the maximum attenuation with respect to the continuum was 
measured and recorded. The rest-frame wavelength at the po- 
sition of maximum attenuation was converted into a proper 
distance at the emission redshift using equation |26] The av- 
erage value of K along those 15 lines of sight was taken to be 
the natural logarithm of the minimum flux divided by phys- 
ical distance. The redshift range used in the simulations is 
2.0 < z < 2.8 in order to reproduce the range of the KBSS 
QSO sightline sample. 

The jack-knife technique was applied 10,000 times to pro- 
duce a distribution of the forest opacity, Kfor, as shown in 
Figure [13] An estimate for the error in Kfor was then de- 
rived from the moments of the distribution. The resulting 
value of Kfoi = 0.0037 ± 0.0005 is in good agreement with 
'^data = 0.0039 ± 0.0005 calculated in Section |5] providing 
confidence that the MC simulations well reproduce both the 
mean value and the dispersion in Kfor for a sample of the same 
size as the KBSS. 



Table 8 

Comparison of the A,nfp derived from the LyC and the renoraiaUzed 
forest spectra 



^em 


Amfp LyC 


A,rfp(l-Ds) 


Amfp LyC 


Amfp(l-Os) 




[pMpc] 


[pMpc] 


[pMpc] 


[pMpc] 




IGM ONLY 


IGM ONLY 


IGM+CGM 


IGM+CGM 


2.2 


19L8 


185.9 


160.2 


151.4 


2.4 


146.5 


135.7 


120.0 


113.4 


2.6 


115.7 


106.8 


93.7 


85.6 


2.8 


91.9 


85.1 


73.5 


66.0 



With an estimate of klls, Ck.lls as well as Kfor and (TK.for, 
it is straightforward to calculate the error in the A^fp. If 



Amfp - 



1 



KLLS + Kfor 

(see equation l23Tl then the uncertainty on A^fp is 



<,-Ll.5 + < 



.for 



'7'mfp — . , T 

(KLLS + Kfor) 



(28) 



(29) 



where o"k,lls and cr^.tor are the uncertainties of klls and Kfor 
respectively. Employing these equations, we find a typical 
uncertainty in Amfp of — 15 Mpc, which we adopt for both 
the IGM and IMGh-CGM Amfp measurements. Since the con- 
tribution of LLSs to the opacity in the CGM is poorly con- 
strained, the error on the CGM measurements is likely some- 
what larger; however since the CGM contributes < 20% of the 
opacity, the effect on the net uncertainty is modest. We adopt 
this uncertainty for the measurement of Amfp at the mean red- 
shift of the sample. 

6.4. Simulated sightlines with line and continuum opacity 

For some applications, it is more useful to know the ex- 
pected attenuation along an ensemble of sight lines includ- 
ing line opacity. For instance, in studies of the escape frac- 



tion of LyC pho tons ('Sh aplev et al. 120061: llwata et al.1 



20091: 



iBridge et al.ll20T 0: Siana et al.ll20ldlNestor et 31.1120111 Stei- 
del et al. in prep) a correction must be made for the fraction 
of the attenuation at a given observed wavelength due to (non- 
ionizing) line blanketing. For these applications, our models 
including CGM attenuation are ideal. The sample of galaxies 
used in our CGM study have impact parameters of 50 - 300 
pkpc from the QSO line of sight, and so our MC simulations 
which include opacity from the CGM effectively model the 
typical gas distribution at radii larger than 50 kpc - and hence 
the attenuation suffered by photons after they leave the ISM 
of the galaxy, but before they leave the large gas densities as- 
sociated with the CGM. 

Using the sets of simulated spectra with Zem = 2.4 that 
include Lyman line opacity for the IGM ONLY and the 
IGMh-CGM, we rneasure the distribution of transmission of 
emitted 880-910 A photons through the IGM and the IGM 
+ CGM; see Figure [14] The transmission is generally lower 
along the IGMh-CGM sight lines. The peak in the transmis- 
sion PDF near — 0.75 corresponds to the average transmission 
within the Ly/3 forest which is < 1 due to line blanketing from 
both higher Lyman series lines as well as Lya absorption from 
lower redshift. 

In Figure[T2] the solid curves show the average spectrum of 
10,000 MC runs that include fine opacity. We use (V-Db), the 
average transmission in the region between 920-1015 A in the 
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Figure 13. The variation in the measured opacity in groups of 15 sightlines due to all absorbers with log(A'Hi/cm ~) < 17.2. The simulations 
used in this plot are those with emission redshift z = 2.8. These show the results of 10,000 jack-knife tests in which 15 spectra were randomly 
drawn from the "LyC ONLY" Monte Carlo runs in which only absorbers with log(A'Hi/cm"^) < 17.2 were included. See Section l631 The dotted 
vertical lines mark the 1 -cr uncertainties in «:. 
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Figure 14. The distribution of transmissions from each sightline averaged over the band pass from 880-910 A for the the IGM-l-CGM Monte 
Carlo simulations (left panel) and IGM only Monte Carlo simulations (right panel) that include line opacity. The simulations used for these 
figures are those with an emission redshift of z = 2.4. The (blue) dashed vertical line marks the average transmission in the Ly/3,7,+ forest 
{i-Ds). 



rest frame of the source (the region shown in the yellow box 
in Figure[TT] Oke & Korvcanskv 1982) to "re-normalize" the 
continuum against which the LyC opacity will be measured. 
Using the renormalized spectrum as an approximation to the 
"LyC o nly" MC spectra and measuring A^fp as described in 
Section l6T2l leads to A^tp values 5-10% smaller than the 
"true" values. Table [8] compares the results of the MC simu- 
lations including line opacity (columns labeled (1 -D^)) with 
those based on the continuum only spectra (columns labeled 
LyC). 

6.5. Comparison with previous Amfp measurements 

In the previous sections, we derived the value of the mean 
free path Amfp to LyC photons traveling through the IGM to 
be: 



If instead, we assume such LyC photons emanate from galax- 
ies similar to one of those in our spectroscopic sample, we 
find a value for A,nfp including the CGM opacity to be: 



IGMh-CGM: An,fp(zem = 2.4) = 121 ± 15 Mpc. 



(31) 



IGM ONLY: Amfp(zem = 2.4) = 147 ± 15 Mpc. 



(30) 



One important advance of the estimate of A^tp made in this 
paper is that all parts of the column density distribution have 
now been measured at the same mean redshift. This has not 
been possible previously due to the difficulty of collecting ap- 
propriate statistical samples for the various column density 
regions of the frequency distribution. 

In this section, we compare these values to previous esti- 
mates from the literature. One caveat is that the measurements 
made in this paper typically differ in mean redshift from the 
samples on which previous Amfp measurements have relied. 
As such, an extrapolation of the literature measurements or 
our own measurements are generally necessary in order to 
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compare the results, and therefore these comparisons are more 
dependent on the value of 7 assumed. Nevertheless, in this 
section we compare to previous measurements for complete- 
ness. Figure [15] gives a graphical summary of the results of 
this comparison. 

In general, the differences between our IGM ONLY Amfp 
results and those of other authors (especially those measured 
at similar z) underscore the degree to which Amfp is sensitive to 
changes in f{N,X), highlighting the importance of the precise 
measurement of the intermediate A^hi systems made here. 

One other caveat is the small degree of incompleteness 
which may be present in the z < 2.4, log(A^Hi/cm"^) > 15.5 
portion of our absorber sample. As described in Appendix IB] 
this bias will only act to shorten the inferred value of A^fp, 
pushing it to values farther below those inferred by other au- 
thors. 

The mean free path and its evolution with redshift are gen- 
erally parameterized as: 



Amfp = Amfp,0(l+Z) 



(32) 



where ^ captures the cosmology (as parameterized in equa- 
tionl26]l as well as the evolution in the number of absorbers as 
a function of redshift, dn/dz (described in Section l6Tt . For 
the Monte Carlo simulations described in this section, the red- 
shift evolution of Amfp is fixed by the assumed value of 7. We 
emphasize that the redshift evolution is only constrained by 
data with 2.0 < z < 3.0, and as such the extrapolation of our 
results above to z >> 3 is ill-advised. 

The redshift evolution of Amfp, parameterized as a function 
of the emission redshift of the QSO, Zem, returned by the MC 
simulations is represented by the solid blue curves in Figure 
[15] and is given by: 



IGM ONLY: Amfp(Zem) = 147 



and 



IGM+CGM:Amfp(Zem)=121 



l+Zen 

3.4 



l+Zen 

3.4 



-4.3 



pMpc (33) 



-4.5 



pMpc. (34) 



The first estimate of Amfp was made bv lMadau et alj (119991) 
who found: 



^.nfP=42(^] Mpc = 87f^ 



Mpc (35) 



using a simple approximation for f{N, X) with /3 = 1 .5 and 7 = 
2 where their Einstein-de Sitter cosmology has been converted 
to the cosmolog y used in t his pa per. 

Faucher-Gig uere et al.l (120081) published an updated es- 
timate to the Amfp assuming = 1.39 as measured by 
iMisawaetaP (120071) . iMisawaetaP (|2007 |) measured the fre- 
quency distribution close to high-A^ni absorbers, and thus their 
sample is similar in slope to our CG M f(N,X) distribution. 
Howev er, the normalization chosen bv lFaucher-Giguere et alj 
(12008) was computed using LLS statistics for the full IGM, 
thereby closely matching our LLS points but somewhat under- 
predicting the incidence of lower-A^ei absorbers resulting in 
slightly larger values of Amfp at a given z: 



Amfp - 85 



— J Mpc: 



160 



-jj) Mpc 



(36) 



iProchaska et al.l (l2009h estimated Amfp using stacked QSO 
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Figure 15. The Amfp as a function of Zsm including previous esti- 
mates of the Amfp from the literature. The blue points and curves 
refer to values measured in this paper whereas estimates from the 
literature are plotted in shades of red. The blue cross and X and 
their corresponding curves show the values of Amfp calculated from 
the MC spectra for the IGM ONLY and IGM+CGM runs respec- 
tively. The dotted and dashed blue horizontal line show the results 
of the analytic calculation in Section [5] excluding and including the 
CGM respectively. The dark red filled star and the dash- dotted curve 
that in tersects it are the suggested values of Amfp from lMadau et alj 
(Il999h. The red fi lled circle an d dashed curve are the results of 
iFaucher-Giguere et al.i (i2008i) . iProchaska et al.l ( 120091) suggested the 
values shown wi th the red/orange as t erisks and the dotted line. The 
measurements of ISongaila & Cowi3 (12010') are shown in the orange 
fil led square and t he dash-dotted curve that intersects it. The results 
of lO'Meara et alj f2012t) are shown in the orange filled triangle. 

spectra to measure the LyC opacity. Notably, such a method 
would include any contribution from the CGM of their QSOs. 
They found 

Amfp = 48.4-38.0(z-3.6) Mpc = 94-38.0(z-2.4) Mpc (37) 

somewhat lower than our measurement at z = 2.4 and with 
very different evolution with redshift than indicated by our 

data. 

ISongaila & Cowi j (12010 ?) revisited the mean free path cal- 
culation after compiling a large sample of z < 1 and z > 4 
LLS. They found 



Amfp = 50 



1+z 
4.5 



-4.44: 



+0.36 
0,32 



-4.44 



+0.36 
0.32 



Mpc = 170 ( —J 1 Mpc 

(38) 

using f3 = 1.3. They also offer a variety of normalizations for 
different /3 values, including 130 Mpc for /3 = 1.5 at z = 2.4, 
in general agreement with our measured "IGM ONLY" value 

of Amfp. 

Recently lO'Meara et al.l (12012ft presented a calculation of 
Amfp using their new measurements of the incidence of LLSs 
at z '^ 2. Their estimates of the incidence of absorbers with 
log(A^Hi/cm~^) < 17.2 leads to a lower value for the opacity 
as described in Section lSTj They found 



Amfp = 250 ±43 Mpc 



(39) 



at z = 2.44, nearly a factor of 2 larger than our estimate and 
significantly larger than the extrapolations of others' measure- 
ments at higher redshifts. 
We note that for a steeper evolution in redshift as found 
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in lSongaila&Cowid (120101) and lProchaskaetalJ (120101 ). the 
general agreement between the extrapolation of the presented 
measurements at (z) = 2.4 to z > 3 would result in a more 
discre pant value significantly be low the measurements at z > 
3.5 by Songaila & Cowid (2010) and Procha ska et al.l (I2009h . 

To summarize, the calculation described in Section |6] of- 
fers three significant improvements over previous measure- 
ments. First, all portions of f(N,X) have been measured at 
the same mean redshift, avoiding the need to extrapolate any 
part of f(N,X) to a different redshift using uncertain values 
of 7. A second advance is the accuracy of our Voigt pro- 
file fitting technique, achieved by including higher-order Ly- 
man series transitions, yielding the best measurement of the 
power law slope, (3, for absorbers with saturated Lya profiles, 
[14 < log(A^Hi/cm"^) < 17.2]. Because such absorbers ac- 
count for ^half of the opacity to LyC photons in the IGM, un- 
derestimating their contribution would result in significantly 
underestimating the total opacity of the IGM. 

The third major improvement with respect to previous stud- 
ies is the inclusion of absorption from the CGM which again 
decreases the measured value of A^fp. If LyC photons are 
presumed to originate within galaxies similar to those in the 
KBSS sample, then every hydrogen-ionizing photon mu st firs t 
propagate through the CGM. As shown by Rudie et al. ( 20121) . 
regions of the IGM near galaxies have a much higher inci- 
dence of high-A^Hi absorbers than the average IGM with the 
effect (again) of reducing Amfp relative to previous measure- 
ments. 

6.6. Implications of the higher opacity of the IGM+CGM 

A shorter mean free path has implications for the emissiv- 
ity and demographics of ionizing sources at high redshift, as 
well as for calculations of the metagalactic UV radiation field. 
Briefly, if one consi ders the spe cific intensity of the UV back- 
ground {Ju, see e.g. [ Scott et al. 20 0fl) or the photoionization 
rate (F, see e.g. iFauch er-Gi guere et al.ll2008l) to be well deter- 
mined, a shorter mean free path directly implies a requirement 
for corres pondingly lar ger ionizing emissivity of sources (e^). 
FoUowing lFaucher-Gi guere et al.l (l2008h . 



Juiz) w — Amfp(z/,z) eAz) 



and 



F oc 7i,. 



(40) 

(41) 

Conversely, for a measured value of e^ from a given popu- 
lation of sources, a shorter An,fp implies a smaller contribu- 
tion to both the specific intensity and the photoionization rate 
(F). Higher intergalactic opacity also requires that the back- 
ground be produced by sources within a smaller volume, pos- 
sibly leading to greater spatial variation in the radiation field 
intensity. 

LyC escape fraction studies have often found very few 
detections of ioniz i ng photons escaping from galaxies 
(iShaplev et al.1 I2OO6I: iBridge et all I20T0I: iSiana et alJ l20Tot 
however see llwata et al.1 120091 iNestor et al.ll201 ll and Stei- 
del et al. in prep). For LyC studies of galaxies at high-z, 
the inclusion of the CGM is an important effect. Consider- 
ing Figure [14] note that the number of galaxies with ^zero 
transmission increases by a factor of 2 when the CGM is in- 
cluded. If the redshift evolution of the CGM is similar to that 
measured for the IGM, the effect of the CGM will become 
more pronounced at higher-z where the incidence of high-A^Hi 
systems in the CGM will be much higher. 



Higher IGM and CGM opacities mean that there may be 
more sources contributing to the metagalactic UV radiation 
field than are implied by the number of actual LyC detec- 
tions; on the other hand, the assumption of a Amfp that is 
too large may cause one to over-estimate the contribution 
to Ji, or F made by a population of detected LyC sources. 
The latter effect may reduce the tension between the appar- 
ent ov er-production of io nizing photons by observed LBGs at 
z ~ 3 (iNestor et al.l201 ih relative to estima tes of the total pho- 
toionization rate produced by all sources (iBolton et al.l 1200 51 : 
iFaucher-Giguere et a l. 2008). 

7. SUMMARY 

Using a sample of 15 high-S/N, high-resolution QSO spec- 
tra drawn from the KBSS, we have produced the largest 
catalog of H I absorbers fit with Voigt profiles to Lya and 
at least one higher order Lyman series transition. This 
analysis enables the first statistically rigorous measurements 
of the frequency distribution of H I absorbers with 14 < 
log(A^Hi/cm"^) < 17. In Section [3] we showed that the fre- 
quency of absorbers as a function of A^hi is well-parameterized 
by a single power-law from 13.5 < log(A^Hi/cm"^) < 17.2 
with a maximum likelihood index of /3 = 1 .65 ± 0.02 and a 
normalizationper unit pathlength of log(CHi) = 10.32. 

In Sectionlll we measured the frequency distribution within 
300 pkpc and both 300 km s"' and 700 km s"' of galaxies in 
the KBSS sample. We showed that the frequency of absorbers 
near galaxies is significantly higher and that the power-law 
index is shallower compared with that of the IGM, meaning 
there are disproportionately more high-A^ni systems than low. 
These findings are discussed further by Rudie et al. (2012). 

Section|5]examined statistically the total opacity of the IGM 
and CGM to hydrogen-ionizing Lyman continuum (LyC) pho- 
tons. This section presented measurements of the opacity 
due to absorbers with log(A^Hi/cm"^) < 17.2 with direct con- 
straints from the data. We also found that a single MLE 
power-law fit over the full range of A^hi does not well repro- 
duce this opacity, and also fails to reproduce the incidence 
of LLSs. We therefore adopted a broken power law fit that 
reproduces both the opacity measured within the data from 
absorbers with log(A^Hi/cm~^) < 17.2 as well as the observed 
incidence of LLSs. The parameters of these fits can be found 
in TableH 

We also measured the fractional LyC opacity in bins of 
A^Hi- We found that 48% of the opacity within one mean 
free path, Amfp, is contributed by absorbers with A^hi< 10'^'^ 
cm"^, the vast majority of which results from absorbers with 
14 < log(A^Hi/cm"^) < 17.2 whose frequency was poorly mea- 
sured prior to this work. 

In Section |6] we used Monte Carlo simulations to mea- 
sure the value of the mean free path (A^tp) of LyC photons at 
(z) = 2.4. We measured both the Amfp of photons through the 
IGM (as has been considered many previous times) as well as 
the Amfp of photons formed in galaxies similar to those in the 
KBSS - photons that must first transverse the CGM of these 
galaxies before reaching the lower-opacity IGM. We found 
Amfp(z = 2.4) = 147 ± 15 Mpc for simulations including only 
IGM opacity and Amfp(z = 2.4)= 121±15 Mpc when we in- 
clude the CGM. These values of Amfp are lower than most pre- 
vious estimates, which have not included the CGM and have 
generally underestimated the contribution of intermediate-A^ni 
absorbers. We also quantified the effect of the CGM on the 
distribution of transmission through random sight lines, rele- 
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vant to measurements of the escape fractions of LyC photons 
from such galactic sources at high-z. 

We note that the A^fp value including attenuation from the 
CGM is relevant only if the dominant sources of LyC photons 
are galaxies similar to the UV color-selected galaxies in our 
sample (0.25 < L/L* < 3.0 at (z) = 2.3) - for other galaxies, 
these measurements are an approximation. However, in all 
cases the general statement holds that A^fp and related statis- 
tics are affected by the ^Mpc-scale gaseous environments of 
the sources that contribute significantly to the metagalactic 
background. 

The IGM and CGM opacities measured in this paper have 
significant implications for studies of ionizing sources at high 
redshift and for estimates of the metagalactic UV background 
atz«2-3. 
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APPENDIX 
A. EITS TO H1GH-Whi ABSORBERS 

As discussed in ^ for log(A^Hi/cm"^) > 14.5, the Lya transition is saturated, complicating the measurement of absorbers with 
higher A^Hi from Lya only spectra. Similarly, for absorbers with log(A^Hi/cm"^) > 15.5, the Ly/3 transition saturates. Thus, the 
precision and accuracy with which high-A^Hi systems can be measured will depend to some degree on the number of accessible 
Lyman lines, thus imposing some redshift dependence. 

For the sample presented in this paper, the QSO spectra cover the Lya and Ly/3 transition for all the absorbers. Given the typical 
spectral coverage of our sample (see Table [T}, for absorbers with z > 2.3, the Ly7 transition is also covered, and for those with 
z > 2.4, Lya, /3, 7, (5 are generally observed. For absorbers with z > 2.5, the spectral coverage of the QSOs typically allows for 
observation of transitions all the way to the Lyman Limit, thus allowing for accurate measures of A^hi to log(A^Hi/cm"^) < 17.2. 

An additional complication arises from the fact that high-A^ei absorbers are often found in blended systems of several high-A^ei 
absorbers. Such absorbers, when observed in only Lya,/3, often cannot be de-blended. The incidence of such blends increases 
with increasing redshift as the density of the forest increases. This blending effect is mitigated in our sample at z > 2.4 due 
to access to addition Lyman series lines, but may go unrecognized in the low end of the redshift interval (z < 2.4) which lack 
optically thin Lyman lines. For such blended systems in the presented sample with z < 2.4, the number of subcomponents may 
be underestimated which may lead to some degree of incompleteness in the sample. 

In this appendix we assess the accuracy of our fits to absorbers with log(A^Hi/cm~^) > 15.5. 

A. I. Examples of fits to high-Nm absorbers 

Figures [T6l - [TS] show examples of the fits to the HIRES spectra for absorbers with 15.5 < (log(A^Hi/cm"^) < 17.0 . In each 
case, the HIRES data are shown in black, and the solid red curve shows the model with the best fit to the data. The dashed 
curves show the ±3cr formal errors on A^hi as reported by VPFIT. As expected, the errors on A^hi are dependent on the number 
of transitions observed; however, the dependence is not trivial as the effect of (a) blending from other proximate absorbers, (b) 
the contamination of higher-order Lyman transitions due to unrelated absorbers, and (c) the S/N of the spectrum in the areas of 
interest all affect our ability to accurately determine the parameters of the absorber 

A. 2. Testing fits with only Lya and Lyj3 

To assess theaccuracyof fits to absorbers with log(A^Hi/cm"^) > 15.5 observed only in Lya and Ly/3, we use the set of absorbers 
with log(A^Hi/cm"^) > 15.5 for which n > 5 Lyman transitions were measured, generally drawn from the subsample with z > 2.4. 
We then attempt to refit these absorbers using VPFIT with constraints from the Lya and Ly/3 region only. The outcomes of these 
tests can be generalized into two principal categories: highly-blended and less-blended absorbers. 

Absorbers for which a portion of one of both edges/wings of the absorption line can be observed in Lya and/or Ly/3 can be fit 
much more reliably. Such systems, when refit with VPFIT using only their Lya and Ly/3 transition result in weaker constraints 
on the line parameters as expected, but result in A^hi determinations consistent within their formal errors with those obtained via 
the higher-order transitions. The fits are not systematically offset to higher or lower values of A^hi, and therefor should not bias 
the measurements of f{N,X) in any large way. 

For absorbers whose Lya and Ly/3 transition are fully saturated and both edges/wings of the absorption line cannot be observed 
due to other neighboring absorbers (e.g. see the left-most panel in Figure [17] and the center panel in Figure [18), essentially 
no information is available on the sub-component structure of the system. Such systems are often fit with a minimum of two 
components because it is rare that the wings/edges of the blended system are symmetric. In the absence of higher-order Lyman 
lines, the number of subcomponents is typically underestimated, likely leading to some degree of incompleteness in the lower- 
redshift portion of the catalog. There would also be a tendency to underestimate the total A^hi in such blends. If the A^hi of 
individual absorbers in the blends were systematically underestimated, this would bias the measurement of f{N,X) to have 
steeper values of /3 than the true value. 

In conclusion, the lack of higher-order constraints on the lower-z saturated absorbers have two principal effects on our mea- 
surements. (1) Relatively unblended absorbers yield somewhat weaker constraints on the parameters that are unlikely to bias 
the measurement of f{N,X). (2) Highly-blended absorbers yield very weak constraints from Lya and Ly/3 only and therefore 
are typically fit with fewer subcomponents and with lower-A^Hi values than would be assigned using additional Lyman series 
constraints. This results in some degree of incompleteness in the lower-z portion of the catalog. 
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Figure 16. Example fits to absorbers with 15.5 < logiNm/crcT") < 16.0. The black curves show the HIRES data. Grey dashed lines mark the 
zero point and continuum of the QSO spectra. The solid red curve shows the best fit Voigt profile to each absorber. The dashed red curves show 
the formal ±3a Nm error. 

A.3. Assessing Incompleteness in the z ^ 2.4, Zo^(A^Hi/cm~^) > 15.5 Catalog 

To estimate the degree of potential incompleteness in the lower-z portion of the catalog due to line blending, we consider the 
fraction of the higher-z, high-A^ei absorbers observed in « > 5 Lyman series transitions that would be unconstrained or miscounted 
with observations of Lya and Ly/3 alone. This method likely overestimates the incompleteness as the degree of line blending and 
contamination is higher in the high-redshift forest than in the lower-z forest due to evolution in the line density as a function of 
redshift. Nevertheless, we report the results of this test, considering separately absorbers in half-dex bins of A^hi- The results are 
summarized in Table |9] 
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Figure 17. Example fits to absorbers with 16.0 < logiNui/cmT^) < 16.5. The black curves show the HIRES data. Grey dashed lines mark the 
zero point and continuum of the QSO spectra. The solid red curve shows the best fit Voigt profile to each absorber. The dashed red curves show 
the formal ±3a Nhi error. 

For absorbers with 15.5 < log(A^Hi/cm"^) < 16.0, 23 are measured with 5 or more Lyman series transitions. Of these, only 2 
would have essentially no constraint on the parameters of the absorber from Lya and Ly/3 alone (e.g. Figure[T6] second from the 
left). For 7/23 of these systems, the constraint on A^hi would be relatively weak, however, an individual component associated 
with that absorber can be clearly identified. All 7 of these systems are fit with much improved accuracy with the additional 
constraint from Ly7. Among absorbers with 16.0 < log(A^Hi/cm~^) < 16.5, we observed 12 absorbers with constraints from 5 
or more Lyman series transitions. Of these, only 4 are strongly constrained from Lya and Ly/3 alone. With Ly7, 8/12 could be 
fit, and with Lya,/3,j, 6, 1 1/12 would be strongly constrained and could accurately be fit. For absorbers with Nm just below that 
of LLSs (16.5 < log(A^Hi/cm"^) < 17.0), we observe 5 systems in 5 or more Lyman series transitions.'^ Of these, only 2 would 



' All five absorbers are observed in a minimum of 9 transitions. We do not observe any absorbers in this A'm range with 5-8 Lyman series transitions observed. 
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Figure 18. Example fits to absorbers with 16.5 < logiNm/crnT^) < 17.0. The black curves show the HIRES data. Grey dashed lines mark the 
zero point and continuum of the QSO spectra. The solid red curve shows the best fit Voigt profile to each absorber. The dashed red curves show 
the formal ±3ct A'hi error. 



IGM and CGM Opacity at (z) = 2.4 
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Table 9 

Constraints on Nm vs. the number of observed Lyman series transitions 



logiWm/cm ^) 


Lines considered 


# with 5+ 


strongly 


constrained 


weakly 


constrained'' 


unconstrained*^ 




in the fit 


Lyman 


series lines" 


# 




% 


# 


% 


# 


% 


15.5 - 16.0 


Lya,/3 




23 


14 




61% 


7 


30% 


2 


9% 


15.5 - 16.0 


Lyo,/3,7 




23 


18 




78% 


3 


13% 


2 


9% 


16.0-16.5 


Lya,/3 




12 


2 




17% 


2 


17% 


8 


67% 


16.0-16.5 


Lya,/3,7 




12 


2 




17% 


6 


50% 


4 


33% 


16.5-17.0 


Lya,/3 




5 


1 




20% 
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40% 
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40% 


16.5 - 17.0 


Lya,/3,7 




5 


3 




60%) 





0% 


2 


40%- 



" The number of absorbers in the catalog with A^hi in the range listed and at least 5 Lyman series transitions observed. 

^ For weakly constrained absorbers, the transitions listed give unambiguous evidence of the presence of an absorber, and 

typically have at minimum one edge of one transition well observed without a significant blend. 

"^ For unconstrained absorbers, the transitions listed are fully saturated and blended with other high-Wm lines (e.g. see middle 

panel Figure [TH meaning this absorber would likely not be counted or be very poorly fit in data with only the transitions Hsted. 
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Figure 19. The frequency distribution measured from the higher-redshift (red) and lower-redshift (blue) portions of the absorber sample (many 
of the poin ts from the two subsamples lie on top of each other). The grey curve shows the broken power law fit to the full absorbers sample 
from EI] 



have no constraint on A^hi from Lya and Ly/3 alone. 

From this assessment based on higher-z measurements, we expect that the principal bias in our measurements of A'hi for high- 
A^Hi systems for absorbers with only Lya and Ly/3 is to under-count the number of systems with log(A^Hi/cm"^) > 15.5. Within 
the present sample, systems with 16.0 < log(A^Hi/cm"^) < 16.5 appear to be most-affected by blending and may therefore be 
the most incomplete. In the next appendix, we add further evidence to this claim by comparing the measurement of f{N,X) for 
absorbers in two redshift subsamples. 

B. THE EFFECT OF THE ACCURACY OF FiN,X) ON THE MEASUREMENT OF THE Anrfp 

Here we assess the redshift dependence of the frequency distribution presented in ^ and ^ We split the absorber sample 
roughly in half and compare those absorbers with z < 2.4 and those with z > 2.4. For the higher-redshift subsample, the spectra 
typically cover at least 4 Lyman series transitions and most of this sample have all the Lyman series transitions observed. Figure 
[T9] shows the results of this comparison. Notably, the higher-redshift subsample (red points) is in reasonable agreement with the 
broken power law fit (gray curves) presented in § 15.1! that were used in the Monte Carlo simulation in ^ Further, the higher- 
redshift sample, tends to lie slightly above the fit. Some portion of this is likely due to the redshift evolution of the line density of 
the forest; however, it is also possible that f{N, X) o f absorbers with log(A^Hi/cm"^) > 15.5 is actually higher than that measured 
from the full sample. As discussed in Appendix |A.3l we expect that the lower-z catalog likely suffers from incompleteness due to 



line blending, especially for absorbers with 16.0 < log(A^Hi/cm"^) < 16.5. In Figure[T9] we note the only bin for which the low- 
and high-z data do not agree within la contains absorbers with 16.0 < log(A^Hi/cm"^) < 16.5. 

The principal result of this paper is the relatively high measured value of f{N,X) for absorbers with 14.5 < (log(A^Hi/cm"^) < 
17.2 which results in a value of Amfp comparable to or smaller than previous measurements. If we were to use the high-redshift 
sub-sample from Figure [T9]to measure the opacity and A^fp, we would find higher IGM opacity and therefore a shorter (more 
discrepant) Amfp- In any case, if there is a systematic bias in the measurements, it is one that would conspire to make Amfp appear 
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larger than its true value. We therefore conclude that it is unlikely that a bias in the precision of our measurements of f(N,X) as 
a function of z would change the principal result of the paper. 



